Journal of 


Experimental Psychology 


ArtHur W. Me ton, Editor 
Air Force Personnel and Training Research Center 


Davin A. Grant, Associate Editor Detos D. Wickens, Associate Editor 

University of Wisconsin Ohio State University 

Consulting Editors 
E. James Ancusnr, University of Wisconsin Lioyvn G. Humpueeys, AF Personnel 
Jupson S. Brown, State University of lowa and Training Research Center 
Custus J. Burne, Indiana University Agtuur L. Inton, Tulane University 
Paut M. Fitts, Ohio State University Howagp H. Kenpier, New York University 
Faeperice C. Frick, Donan B. Linpstey, 

Massachusetts Institute of Technology University of California, Los Angeles 
Freanx A. Geiparp, University of Virginia Kennetn MacCorguopare, University of Minnesota 
James J. Gizson, Cornell University Quinn McNeman, Stanford University 
Crarence H. Grauam, Columbia University Near E. Mitier, Yale University 
Harotp W. Haz, University of Illinois Kennetu W. Spence, State University of lowa 


Artuur C. Horrman, Managing Editor 
Haren Org, Assistant Managing Editor 
Editorial Staff: Sap J. poy Frances H, Crarx, Barsara CumMMINGS 





CONTENTS 


Effects of Course Frequency and Aided Time Constant on Pursuit and Compensatory 
Tracking: R. Cuzenitxorr anv F, V. Tayior 285 


PE Ras eee Taiiation a0 9 Penien of Ansan of Tralsing and Stow: 


Patermo 293 

Serial Effects in Auditory Threshold Judgments: W. J. McG 
The Abstraction and Conceptualization of Form, Color, and Number: J. F. Wountwm ... 304 
The Effect of Drive Reversal on Latency, so ony Teng tase cael a 

Response Strength as a Function of Delay of Reward in a Run 
W. B. Horper, M. H. Marx, E. E. anv G. Coxtizr 316 

Factors Contributing to General Versus poaeite Perceptual Learning : 

J. A. Lzonarp, H. W. Crarxe, anv S. R. Staats 324 

Application of the -— wa Technique to Problem-Solving Tasks: 
R. A. Gorpsecx, B. B. Bernstern, W. A. Hux, ann M. H. Marx 330 


The Relation of Eye Movements During S to Dream Activity: An Objective Method 
for the Study of Dreaming: W. vn fay Ay KLEITMAN 13 : 


Effect of Irrelevant Thirst Motivation on a Response Learned with Food Reward: 
G. R. Grice anv J. D. Davis 347 

Prediction From and Interaction Among Multiple Concurrent Discriminative Responses 
Cc. W. oe 353 
Supplementary Report: Response Availability in Human Trial-and-Error me 4 
Nosie 359 





American Psychological Association 
Vol. 53 No. 5 May 1957 





JOURNAL OF EXPERIMENTAL PSYCHOLOGY 


American Association, Inc., 1333 Sixteenth St. N.W., Wash- 
ington 6, D. C 


nA peapeanns senrpeemnde pry tal i igations which are 
intended to contribute toward development of prycholay a an exer 
mental science. Studies with normal human subjects are favored over 
studies involving abnormal or animal subjects, except when the latter are 
specifically oriented toward the extension of behavior theory. Experimental 


psychometric studies and studies in ied experimental psychology or 
ineering aay Be scepted i they have broad implications 


cannot be aonett, 
-experiment reporting 
ing of the data and 


most doctoral dissertations) must be presented in a si article. 
provision is made for the publication of Supplementary and li- 
cation Reports in articles of not more than 1.5 printed pages (see Editor’s 
Note, this JourNAL, 1957, 53, 1-2). 

Address all articles submitted for publication to the editor, Arthur W. 
Melton, Air Force Personnel and Training Research Center, P. O. Box 1557, 
Lackland Air Force Base, Texas. All manuscripts should be submitted in 
two complete oom: only the original copy will be returned if the article 
is not accepted for publication. Figures may be original line drawings or 
age photographic prints, but should not exceed 84 by 11 in. Care should 

taken to insure legibility of labels and critical features of drawings when 
reduced to a single-column or (exceptionally) a double-column figure in 
the Fiemme 

© save printing costs supplementary material to articles in this JouRNAL 

are deposited with the American Documentation Institute. Copies may be 
obtained by ordering the Document Number given in the footnote reference 
should be sent to ADI Auxiliary Publications Aare 


(Psychol. Bull., 1952, 49 
of the manuscript should 








Journal of 


Experimental Psychology 








VoL. 53, No. 5 


May, 1957 








EFFECTS OF COURSE FREQUENCY AND AIDED TIME 
CONSTANT ON PURSUIT AND COMPENSATORY 
TRACKING 


RUBE CHERNIKOFF AND FRANKLIN V. TAYLOR! 
Naval Research Laboratory 


Previous studies (2, 3, 6, 9) have 
shown that independent variations in 
display mode (pursuit vs. compensa- 
tory), aided tracking time constant, 
and frequency of the target course 
input have important effects upon 
tracking performance. The purpose 
of this experiment is to study the 
interaction among these parameters. 

In pursuit tracking, S is required 
to position a control marker to coin- 
cide with an independently driven 
target course marker. With com- 
pensatory tracking, S’s task is to keep 
a moving marker which is positioned 
by the difference between the target 
input and the control output, aligned 
with a reference marker fixed at the 
center of the display. 

The simplest form of aided tracking 
obtains when a movement of the 
control causes a proportional change 
in the position of the control marker, 
and simultaneously introduces a 
change in marker velocity as well. 
The ratio of the position sensitivity 
to the velocity sensitivity is termed 


1 The authors are indebted to Mr. Henry P. 
Birmingham for his assistance in the equipment 
design and for the many suggestions made 
during the formulation of the problem. 


the aiding time constant. This con- 
stant is computed as follows: Aiding 
time constant (in sec.) = marker 
displacement per degree of control 
displacement (in inches per degree) 
divided by marker velocity per degree 
of control displacement (in inches per 
sec. per degree). 

In a Foxboro Company report (4) 
it was first suggested that aided 
tracking be considered the general 
case, with position tracking one 
extreme special case containing no 
velocity component, and velocity (or 
rate) tracking the other extreme, in 
which no position component is pres- 
ent. By thus considering tracking 
control systems on an aiding con- 
tinuum, position control is defined as 
having an aiding time constant of 
infinity (*), and rate control as 
having a time constant of zero. (The 
value of all time constants in this 
report is in seconds.) These two 
limiting time constants were employed 
in this experiment along with an 
intermediate value of .5 sec. The 
latter was chosen because it is fre- 
quently taken by engineers to repre- 
sent an optimum value for practical 
gun fire control systems. Tracking 





286 


performance with these three time 
constants for both pursuit and com- 
pensatory modes was compared on 
three different target courses, com- 
posed of frequencies chosen to cover 
a wide range of tracking difficulty. 


Mertuop 


Apparatus 
Display.—A 5-in, cathode ray tube (CRT) 


with a P-11 short-persistence screen was used 
to display the tracking information. By means 
of an electronic switch, the CRT beam was 
time-shared to produce two spots of light. One 
spot was shaped into a }-in. vertical line, while 
the other was left as a x-in. diameter dot. 
During pursuit tracking, the vertical line was 
controlled by S and the dot positioned by a 
course generator. When compensatory tracking 
was used, the dot was fixed in the center of the 
CRT face as the reference, while the vertical line 
was driven by the difference between the course 
generator output and S’s control movements. 

Control.—The S positioned the vertical line 
marker by left-right movements of a horizon- 
tally mounted joy stick. This stick consisted of 
a flexible steel bar, } in. thick, § in. wide, and 
24 in. long, with a Statham strain gauge attached 
to the bar to provide a noise-free pick-off of 
control stick motion. With position tracking 
(#), the joy stick-to-marker sensitivity was set 
so that a lateral stick movement of $ in. (by a 
force application of 4 oz.) resulted in a 1-in. 
displacement of the marker on the CRT face. 
(Stick displacement is given in inches because 
of the difficulty in specifying the angular dis- 
placement of the flexible control.) With rate 
tracking (0) a stick displacement of 4 in. pro- 
duced a marker velocity of 2 in./sec. With .5 
aiding the relationship between the joy stick and 
marker was such that a 4-in. stick displacement 
produced both a position change of 1 in. and a 
velocity change of 2 in./sec. 

Courses.—Three courses, each consisting of a 
complex of three sine waves, were used in the 
experiments. ‘The frequencies comprising each 
of the courses are listed in Table 1. For all 
courses, the relative amplitude of each sine wave 
was inversely proportional to its frequency. 
This provided an equal maximum rate for each 
frequency of the course. As displayed, all three 
courses were of the same amplitude and the 
maximum excursion of the marker was across 
the center 4 in. of the CRT face. 

Scoring.—Since an AC voltage, which is 
proportional in amplitude to the separation 
between the spots on the CRT face, exists at the 


RUBE CHERNIKOFF AND FRANKLIN V. TAYLOR 


TABLE 1 


Sine Wave Frequencies Comprisinc 
Facu or THe Courses 


Frequencies (Cycles per Minute) 


44/9 
11 1/9 
17 7/9 


26 2/3 


10 2/3 


output of the electronic switch, it was possible 
to obtain directly a measure of spot separation. 
By use of an electronic integrator, this error 
voltage was rectified and then integrated over 
the scoring duration by cumulating a charge on 
a condenser. Integrated error was read directly 
on a voltmeter. 


Procedure 


The S sat facing the CRT display at a viewing 
distance of approximately 12 in. A fluorescent 
desk lamp was placed behind the CRT to provide 
room illumination with a minimum of reflection 
from the CRT face. ‘The joy stick was mounted 
10 in. to the right of the CRT centerline and 2 
in. above the height of S’s arm rest. 

Before the experimental sessions were begun, 
two training sessions consisting of four practice 
trials per condition were given to familiarize S 
with the equipment. No attempt was made to 
train S to respond in any specific manner to the 
various conditions. For pursuit tracking, S was 
instructed to move the joy stick to the left or 
right so as to keep the vertical line marker on 
the moving course dot, while for compensatory 
he was instructed to keep the vertical line marker 
on the fixed reference dot. 

Three experiments were run utilizing the same 
design but differing in the course frequencies 
which comprised the tracking problem. The 
order in which the courses were run was as 
follows: Exp. I, Course B; Exp. II, Course A; 
and Exp. III, Course C. Eighteen Navy en- 
listed men served as Ss, six in each experiment. 
The six conditions, #-compensatory (#-C), 
0-compensatory (0-C), .5-compensatory (.5-C), 
*~pursuit (*—P), O-pursuit (0-P), and .5- 
pursuit (.5-P), were presented in a balanced 
latin-square design in blocks of three trials per 
condition for 18 daily sessions. One practice 
trial always preceded each block of scored trials. 
A trial was 1 min. in duration, with the last 45 
sec. scored by the error integrator. ‘To reduce 
fatigue effects from an overly long tracking 
period, the six conditions were presented in two 
sittings of three conditions each. The interval 
between successive sittings was approximately 


4 hr. 





PURSUIT 


AND COMPENSATORY TRACKING 





T 


PURSUIT 


ess 
sss 


s 


INTEGRATED ERROR (ARBITRARY UNITS) 





- 
COMPENSATORY 











Fic. 1. 


Tracking performance data showing interaction between aiding time constants 


and course frequency. 


The Il-min. course duration and 45-sec. 


scoring interval were automatically programmed, 
After each trial, E recorded the integrated-error 
score and allowed a 30-sec. rest interval before 
starting the next trial. 


ResuLts AND Discussion 


Since the main interest in this 
study was comparative tracking per- 
formance after S had reached a fairly 
stable level, the statistical analyses 
and graphical representations are 
based on the data collected during 
the last four days of each experiment 
(Sessions 15-18). Figure 1 presents 
a graph of the tracking performance 
data showing the interaction between 
the aiding time constants and the 
course difficulty levels. Application 
of Bartlett’s test to the data for each 
of the three experiments indicated a 
lack of homogeneity of variance for 
the measures of one of the experi- 
ments (Course A). Therefore, sig- 
nificance of differences between the 
various conditions was determined by 
using Wilcoxon’s (10) ranking pro- 


cedure for paired replicates. Com- 
parisons between conditions were 
based on the data for 6 Ss, replicated 
over the last four sessions. The 
results of the nonparametric 7’ tests 
are shown in Table 2. 

In Fig. 2 the data are plotted in a 
way so as to afford a direct comparison 
of tracking performance on pursuit 
and compensatory displays. ‘Three 
results are apparent from the data: 
(a) From Fig. 1, tracking error with 


TABLE 2 


Sicniricance Levets ror Comparisons 
Between Conpitions Usinc 
Nonparametsic 7 Tests 


| } | 
Comparisons | Course A | Course B | Course C 
_ — = = i 


0-P vs... Ol 
01 
Ol 
Ol 
Ol 
OS 
Ol 
Ol 
Ol 


gvPs Bus au 





RUBE CHERNIKOFF;AND FRANKLIN V. TAYLOR 





T - | 


o--© COMPENSATORY 


a 





INTEGRATED ERROR (ARBITRARY UNITS) 








05 
AIDED TIME CONSTANT (SEC) 


Fic, 2. Comparison of tracking performance 
on pursuit and compensatory displays as a 
function of aiding time constants and course 
frequency. 


both display modes in association 
with all three time constants in- 
creases rapidly, with accompanying 
shifts in the relative proficiency of 
the various time constants, as the 
target course frequencies are in- 
creased, (b) Also from Fig. 1, with 
both pursuit and compensatory track- 
ing, the optimum aided tracking time 
constant shifts upward with increasing 
target course frequencies. However, 
for the total range of frequencies 
covered, the best single time constant 
of those tested was .5. (c) From 
Fig. 2, pursuit tracking with a time 
constant of © is superior to compen- 
satory at all frequencies. The pur- 
suit mode is also superior to the 
compensatory with time constants of 
0 and .5 at the high and medium 
frequencies, but with these two time 
constants at the lowest frequencies, 
compensatory tracking becomes su- 
perior to pursuit. 


The fact that tracking error increases 
as the frequencies in the target course are 


raised is not new, since many other 
studies have shown similar results. 
However, one of the main contributions 
of the present experiment is the demon- 
stration that the accuracy of systems 
embodying different time constants falls 
at different rates with increased input 
frequencies. It is, indeed, this fact 
which produces the upward shift with 
frequency in the optimum time constant 
which is noted above as the second 
finding. 

In order to account for the shift in the 
optimum time constant, it is necessary 
to consider the purpose served by aiding 
in a man-machine system. As was ex- 
plained earlier, the simplest form of 
aiding is achieved by inserting into the 
control mechanism some type of velocity 
generating device in such a way that 
when § applies force to his control he 
changes not only the position of the 
controlled marker but also the. velocity 
at which the marker is being driven. 
The velocity and the position terms play 
different roles in the aiding process. 

The velocity component has several 
different effects, with the three most 
important being (a) the “unburdening” 
of § through making it unnecessary for 
him to track the velocities in the target 
course (1), (4) the introduction of a time 
lag into the control system, and (c) the 
increased complexity of S’s pattern of 
force application in making quick step 
function positional corrections. This 
factor is felt to be most important when 
high frequency courses are being tracked. 
Under such conditions S§’s ability to 
follow continuously is taxed to the point 
where the tracking problem begins to 
resemble that posed by a course com- 
posed of a succession of step function 
changes. In regard to the matter of 
complexity, 5, when using pure position 
control, needs only to apply a force and 
maintain it in order to move the con- 
trolled quantity from one stationary 
position to another. However, with 
velocity control, the same correction calls 
for an application of force followed by 
its removal. 

In general, the unburdening effect of 
the velocity generating device assists 5, 
while the increased complexity tends to 





PURSUIT AND COMPENSATORY TRACKING 


work against him. The time lag proba- 
bly exerts two effects which act in op- 
posite directions. On the one hand, the 
lag may benefit performance, at least at 
low frequencies, as a result of filtering 
out or damping S’s high-frequency 
“noise” (8). At the same time, it 
delays S’s knowledge of the results of 
his corrective actions and thus tends to 
degrade performance. However, this 
detrimental effect of the time delay is to 
a great extent overcome in an aided 
tracking device by the position com- 
ponent. For when tracking with a 
spring-restrained joy stick, the position 
term represents directly the force output 
of S and thereby serves to give him 
completely up-to-date visual information 
about what he is doing with his hand. 
Aided tracking, then, provides a com- 
plex compromise of factors, some of 
which assist § by reducing his task while 
others act in an opposing fashion to make 
his task more difficult. It may be sug- 


gested that the shift in optimum aided 
tracking time constant observed in the 
experiment is to be accounted for by a 


change in the balance of these factors as 
the input frequencies are altered. Thus, 
at very low frequencies, where the major 
portion of the tracking task consists of 
matching the slowly changing velocity of 
the target, the beneficial aspects of the 
velocity term might be expected to pre- 
dominate over its deleterious effects. 
This would suggest that the velocity 
term should assume a large value and 
hence a low time constant would be 
called for. On the other hand, with high 
target frequencies, the velocities are 
changing with great rapidity so that very 
little benefit can be anticipated from 
unburdening and damping, whereas the 
disruptive effects of the velocity com- 
ponent now exert their full force. This 
would mean that a high valued time 
constant would be required for the 
precise tracking of target courses in 
which high frequencies predominate. 
Finally, with intermediate target fre- 
quencies, one would anticipate that the 
greatest benefit would be obtained by 
balancing the help furnished by each of 
the two terms through providing ap- 
preciable values of both. The aided 


289 


tracking time constant of .5 sec. would 
thereby be expected to be superior to 
the other two values tested. 

The direction of the shift in the 
optimum time constant observed in the 
experiment is consonant with ‘this ex- 
planation. With the high-frequency tar- 
get, © was clearly the best, whereas .5 
gave the lowest error scores at the inter- 
mediate frequency, although it did not 
differ significantly from © at this point. 
At the low target frequency, where it 
would be expected that 0 would be 
optimum, the 0- and .5-sec. constants did 
not differ from one another with the 
compensatory mode, and with pursuit, 
.5 sec. was associated with the best 
tracking at this frequency. However, 
from a consideration of the slopes of the 
curves, it would appear that with lower 
frequencies than those employed in this 
experiment, the 0- and .5-sec. curves 
would have crossed as predicted. 

A clearer way of noting the trend is to 
consider the data for the time constant 
of ~, At the high end it is the best 
value, at the middle frequency it drops 
to a position intermediate between 0 and 
5 (it is significantly different only from 
0) and with the low-frequency target 
course it is the worst of the three con- 
stants. 

The findings suggest that it would be 
not only desirable to extend the tests to 
a lower frequency than any utilized in 
the experiment, but that the medium 
course frequencies which were employed 
may have been too high. With a some- 
what lower intermediate frequency, .5 
sec. might then become significantly 
superior to the other two values. 


The results lead to the practical suggestion 
that if control or tracking devices are to be built 
to work within fairly narrow frequency ranges, 
time constants should be inserted which are 
appropriate to the expected range, e.g., high 
constants for high frequencies and low constants 
for low frequencies. However, if it is known in 
advance that the system must perform accept- 
ably over a very wide range of input frequencies, 
the single best constant to be recommended is 
one at or near .5 sec. Better still would be a 
method of appropriately shifting the aided 
tracking time constant as the input frequencies 
changed. 





290 


The third result of the experiment is 
that pursuit tracking is superior to 
compensatory at medium and high fre- 
quencies for all three time constants, but 
that the pursuit mode loses its advantage 
at the low frequencies when a velocity 
term is incorporated in the tracking 
system. 

It is entirely likely that the general 
superiority of the pursuit tracking mode 
over the compensatory arises from the 
additional information presented to S by 
the pursuit display. The compensatory 
indication presents directly to § only 
visual error in terms of direction and 
amplitude. No immediate inference as 
to the nature of the target course is 
permitted since motions of the indicator 
due to the target input and movements 
due to S's corrections are mutually 
contaminated through being mixed to- 
gether. This contamination ' becomes 
more and more troublesome as the fre- 
quencies of the target are raised. 

In contrast to this “single channel” 
indication, the pursuit display presents 
three channels of information rather than 
only one. Visual error is here shown as 


the distance between the target course 
marker and the indicator which repre- 
sents the response of the system to S§’s 


force applications. By means of this 
one display, information is provided to 
S which permits him to take account of 
the properties of the target and the 
tracking system independently of one 
another, as well as the nature of the 
error. 

From these considerations, it is not 
difficult to account for the superiority of 
pursuit over compensatory tracking 
found for the majority of test points 
employed in the experiment. It is 
important to add, however, that this 
obtained result does not take account of 
the fact that the intrinsic properties of a 
compensatory display permit its sensi- 
tivity to be increased considerably, 
whereas this is not usually true of the 
pursuit mode. In the present experi- 
ment this potential advantage of com- 
pensatory tracking was not exploited, as 
it could be in engineering practice, and 
therefore the study does not furnish a 


RUBE CHERNIKOFF AND FRANKLIN V. TAYLOR 


direct comparison of the two modes with 
both fully optimized. In view of this, 
it is particularly striking to note that 
for the lowest target course frequencies, 
the superiority of the pursuit display 
disappears for aiding ratios of .5 and 0 
and that compensatory tracking becomes 
the more accurate. Yet this finding is 
quite understandable when the effects of 
aiding discussed next are considered. 

With low target frequencies it becomes 
possible, through the use of the velocity 
component, to so completely unburden 
S that his task becomes extremely simple. 
When thus unburdened, a close approach 
is made to what has been called 
“stimulus-response  integrity’’ (1). 
Under this condition, the information 
contained in the instantaneous ampli- 
tude value of the visual error is sufficient 
to specify completely the response which 
will reduce this value to zero. Thus, if 
perfect stimulus-response integrity could 
be achieved, an invariant and propor- 
tional relationship would be found to 
obtain between error amplitude and the 
required S response. The S would not 
have to take account of target velocities 
or error velocities nor would he have to 
predict in any fashion. 

Although perfect stimulus-response in- 
tegrity cannot be achieved fully in 
practice, it can be approached so closely 
with low-frequency inputs that S, with 
his inherent perceptual and motor limi- 
tations, can do his best tracking only by 
attending fully to error amplitude and 
by rejecting all other information. It 
seems possible, therefore, to account for 
the superiority of compensatory tracking 
over pursuit at time constants of .5 and 
0 with the lowest target frequencies on 
the basis of the fact that it is easier to 
give full attention to error on the com- 
pensatory than on the pursuit display. 

The findings of this part of the study 
lead to the recommendation that a 
compensatory, rather than a pursuit, 
display should be utilized in practical 
control systems intended for following 
low-frequency inputs if a velocity term 
is involved. The marked superiority of 
pursuit tracking over compensatory at 
the higher frequencies might suggest 





PURSUIT AND COMPENSATORY TRACKING 


that, where possible, the pursuit mode 
should be employed in control devices 
intended for following high-frequency 
inputs. However, since no effort was 
made in the experiment to optimize the 
sensitivity of the compensatory display, 
the possibility is not precluded that 
compensatory tracking of high-frequency 
targets could be made to be much more 
precise than is shown in this study. 

In view of the results of the present 
experiment, it may be suggested that 
some of the discrepancies in the tracking 
literature may have arisen as a result of 
various investigators employing target 
courses of different frequency ranges and 
distributions. Much of the early engi- 
neering work on aided tracking, which 
showed improvement in tracking ac- 
curacy with the introduction of the 
velocity term, was carried out with 
actual or simulated gun-fire control 
target courses which consist predomi- 
nantly of low-frequency components. 
In contrast to this, the tracking research 
carried out in psychological laboratories 
has typically employed target courses of 
relatively high frequency, against which 
a velocity component in the tracking 
system is ineffective or actually dis- 
ruptive. 

Thus, the findings of Lincoln and 
Smith (5) that pure position tracking is 
superior to tracking with time constants 
of .5 and O sec., may be accounted for if 
it is assumed that they employed rela- 
tively high target frequencies comparable 
to our Course C. Although the fre- 
quency distribution of their course is not 
specified, the general description of their 
cam suggests that this might be the case. 

In the present study, course difficulty 
was altered by changing the frequencies 
in the course while holding the amplitude 
constant. That a different result would 
have been obtained if the difficulty had 
been changed by altering the amplitude 
while holding frequency constant is 
indicated by an experiment by Pearl, 
Simon, and Smith (7). Working with 
a pursuit display, they compared track- 
ing precision with time constants of .25, 
.5 and 1.0 sec. at three levels of course 
dificulty. These stages of difficulty 


291 


were achieved by altering course speed 
and amplitude together while holding 
the frequency components of the course 
unchanged. Although their results are 
somewhat difficult to interpret, there is 
an indication in their data of a downward 
shift in the optimum time constant with 
an increase in course difficulty. This 
provides an interesting contrast to the 
present results wherein the optimum 
shifted upward with increased difficulty 
brought about by upping the target 
course frequencies. 

A previous study (3) comparing pur- 
suit and compensatory tracking with a 
control system including two integrators 
showed no difference between the two 
modes on a course containing frequencies 
lower than the present study’s Course A, 
in which compensatory was superior with 
time constants of 0 and .5. This ap- 
parent discrepancy in results between the 
two studies at the low frequencies may 
have been due to differences in the design 
and conduct of the experiments, the 
types of tracking controls, the sensi- 
tivities of the scoring systems, or the 


number of integrators in the control 


systems. Further research may isolate 
the important variables determining the 
precision of pursuit and compensatory 
tracking at low course frequencies. 


SUMMARY 


From previous studies, the independent vari- 
ations in display mode (pursuit vs. compensa- 
tory), aided tracking time constant, and fre- 
quency of the target course input were found to 
affect the precision of tracking performance. 
This experiment was designed to study the 
interaction among these parameters. 

Tracking performance with aiding time con 
stants of 0, .5, and @ for both the pursuit and 
compensatory modes was compared on three 
target courses, composed of frequencies chosen 
to cover a wide range of tracking difficulty. 
Three studies were run, differing only in the 
course input used. In each study, Ss were given 
three trials daily for 18 sessions, on all six com- 
binations of time constants and display modes. 
A trial was 1 min. in duration, with the last 45 
sec. scored by an error integrator. Each course 
was composed of three sine waves; the display 
was a 5-in. cathode ray tube; and the control 
was a spring-centered joy stick. 
results were obtained : 


The following 





292 RUBE CHERNIKOFF AND FRANKLIN V. TAYLOR 


1. As the course frequencies increased, there 
was a rapid rise in tracking error, with accom- 
panying shifts in the relative proficiency of the 
various time constants. 

2. With both pursuit and compensatory 
tracking, the optimum aiding constant shifts 
upward with increasing course frequencies, al- 
though for the total range of frequencies covered 
the single best constant was .5. 

3. Pursuit tracking is superior to compen- 
satory with all time constants for the two courses 
of high and medium frequencies. However, the 
course composed of the lowest frequencies 
showed pursuit to be better than compensatory 
only at the time constant of #, whereas com- 
pensatory was superior using 0 and .5 aiding 
constants. 

A theoretical analysis of the shifts in the 
proficiency of the aiding constants and the 
changes in effectiveness of the two display modes, 
as related to the change in course frequencies, 
is presented. 


REFERENCES 


1. Binmincuam, H. P., & Tayvtor, F. V. A 
human engineering approach to the design 
of man-operated continuous control sys- 
tems. U.S. Naval Res. Lab. Rep. No. 
4333, 1954. 

2. Cuernixorr, R., Bramincuam, H. P., & 
Tayior, F. V. A comparison of pursuit 
and compensatory tracking under con- 
ditions of aiding and no aiding. J. exp. 
Psychol., 1955, 49, 55-59. 

3. Cueanixorr, R., Biamincnam, H. P., & 
Tayzor, F. V. A comparison of pursuit 


and compensatory tracking in a simulated 
aircraft control loop. J. appl. Psychol., 
1956, 40, 47-52. 


. Foxporo Company. Studies in aided 


tracking. Memorandum No. 25 to Div. 
7, N.D.R.C., Foxboro Co., Foxboro, 
Mass., 1945, 1-43. 


. Lincoun, R. S., & Smirn, K. U. Systematic 


analysis of factors determining accuracy 
in visual tracking. Science, 1952, 116 
(3008), 183-187. 


. Nosze, M., Frrrs, P. M., & Warren, C. E. 


The frequency response of skilled subjects 
in a pursuit tracking task. J. exp. Psy- 
chol., 1955, 49, 249-256. 


. Peart, B. E., Simon, J. R., & Smiru, K. U. 


Visual tracking: IV. Interrelations of 
target speed and aided-tracking ratio in 
defining tracking accuracy. J. appl. 
Psychol., 1955, 39, 209-214. 


. Rocxway, M. R. The effect of variations 


in control-display ratio and exponential 
time delay on tracking performance. 
USAF, WADC Tech. Rep., 1954, No. 
54-618, 


. Searte, L. V. Psychological studies of 


tracking behavior. Part IV—The inter- 
mittency hypothesis as a basis for pre- 
dicting optimum aided-tracking time 
constants. U.S. Naval Res. Lab. Rep. 
No. 3872, 1951. 


. Witcoxon, R. Some rapid approximate 


statistical procedures. Stamford, Conn.: 
American Cyanamid Co., 1949. 


(Received May 14, 1956) 





Journal o; Reprints & holog 
Vol. 53, = , 1957 ” , 


PROACTIVE INTERFERENCE AND FACILITATION AS A 
FUNCTION OF AMOUNT OF TRAINING AND STRESS! 


DAVID S. PALERMO* 


Iowa Child Welfare Research Station, State University of lowa 


In a recent study, Castaneda (2) 
found a significant interaction be- 
tween stress and type of S-R pair 
learned in a second task. One type 
of S-R pair was identical to those 
learned in a previous task, and the 
other type involved re-pairing of 
stimuli and responses learned in the 
first task, a procedure suggested by 
Duncan (4) and Porter and Duncan 
(6). The interaction was based on a 
tendency for performance of the stress 
group to be poorer than that of the 
nonstress group where re-pairing was 
required, but better where it was not. 
Assuming that the incorrect response 
tendency was dominant where re- 
pairing was required, while the correct 


response was dominant where it was 
not, and that the pacing condition 
(stress) served to increase the drive 
level, these results were interpreted 
as consistent with Spence’s deriva- 


tions (7) from Hull (5). Using a 
similar procedure, Castaneda and 
Palermo (3) obtained results sug- 
gesting that the magnitude of im- 
pairment or facilitation of perform- 
ance as a result of an increase in 
drive varies as a positive function 
of the strength of the dominant habit. 

The present experiment employed 
a modification of the procedure re- 
ported elsewhere by Castaneda (1) 
to investigate the effects of stress on 


1The present paper is a portion of a dis- 
sertation submitted to the Graduate College 
of the State University of Iowa in partial ful- 
fillment of the requirements for the Ph.D. 
degree in the Iowa Child Welfare Research 
Station. The writer is indebted to Dr. Alfred 
Castaneda who directed the dissertation. 

2 Now at Southern Illinois University. 


performance of a task in which the 
dominant habit associated with a 
given stimulus was either a correct 
habit which was either relatively 
weak or strong, or an incorrect habit 
which was either relatively weak or 
strong. 


Metuop 


Subjects and design.—A total of 78 fourth- 
grade Ss participated in the present experiment.* 
However, following the procedure reported by 
Castaneda (2), 27 of these Ss were eliminated 
from the analysis because of failure to attain a 
criterion of 16 correct responses in the first task. 
Of the remaining 51 Ss, 25 were assigned to the 
paced (stress) group and 26 to the nonpaced 
(nonstress) group. 

Apparatus and procedure.—The apparatus, 
identical to that reported by Castaneda (2), 
consisted of a box with a response panel on which 
was located two push buttons and a green 
jeweled reflector. Above the panel the four 
colored stimulus lights were presented through 
an aperture. If the correct button was selected, 
the stimulus light went off and the green light 
went on. Incorrect responses affected neither 
the stimulus light nor the light on the response 
panel. 

The procedure was the same as reported by 
Castaneda (2) except that in the first task 20 
trials were given such that two of the lights 
appeared eight times apiece, two appeared twice 
apiece, and no single light appeared twice in 
succession. Each light remained on until the 
correct response was made. Thus, each S 
received two reinforced presentations to two 
lights and eight to the other two. In the second 
task each light was presented 10 times, and 
again, no single light appeared twice in succes- 
sion. However, only one response was per- 
mitted to each light. 

In both tasks two of the lights were turned 
out by one button and two were turned out by 

*Gratitude is expressed to Mr. David K. 
Stewart, Elementary School Curriculum Co- 
ordinator, lowa City, Iowa, for his assistance 
and cooperation in arranging for facilities and 
subjects. 


293 





294 


the other button. Of the four lights used in the 
first task, one which had appeared eight times 
and one which appeared twice were selected for 
re-pairing in the second task. 

The instructions for the first task were 
designed to indicate to S that he was to learn 
which button turned off each light and that if 
an error was made, correction would be allowed. 
The second task followed immediately, and 
neither group was given any information re- 
garding the fact that two of the pairs required 
re-pairing. The nonpaced group was told to 
continue but that now correction would not be 
allowed when an error was made, ‘The paced 
group was given the same instructions, and in 
addition, instructions designed to indicate that 
speed of response was highly important. The 
E demonstrated that the light would be on for 
1 sec, and told S that a response had to be made 
before the light went off or it would be counted 
as an error. However, during the task the 
interval timer was set to give a 2-sec. exposure 
of the light rather than the l-sec. exposure 
demonstrated. If a response was not made 
during this interval, the same light was re- 
presented until a response could be recorded for 
that trial. Errors due to failure to respond 
within the interval were not included in the 
data. No S had to have any single light re- 
presented more than once. Failure to respond 
within the interval occurred among 14 Ss for a 
total of not more than three trials for any one 
S. Of the total number, 9 responses were to 
the lights requiring no re-pairing, and 13 were 
to the lights requiring re-pairing. 


Resutts AND Discussion 


Table 1 presents the mean number 
of errors made by each group to the 
four separate stimuli in the first task. 


TABLE 1 


Exrors To THe Four Stimutt i 
THe Fixst Task 


| Pairing in Second Task 


| 
| Not Re. Paired Re- Paired 
Level of | 
Habit 
Strength Non 


paced 


| Non- 
» 
| Paced paced 


| 


| Paced 


D\ so| aot wl SD 
3/53) 142 “391.731 71 | 801 3 75 
2|.33| .40|.57|.38|.56|.40|.49 


Weak 
Strong 


[as 
. 
! 


7 
a 


DAVID S$. PALERMO 





MEAN NUMBER OF ERRORS 








“WEAK STRONG 
(2 TRIALS) (8 TRIALS) 


RELATIVE STRENGTH 
OF DOMINANT HABIT 


Fic. 1. Mean number of errors for stress 
and nonstress groups on the stimuli which 
received different amounts of training and were 
re-paired or not re-paired. 


The group which was paced in the 
second task made more errors on all 
four stimuli; however, the differences 
are significant (P < 01) on only 
those stimuli which were not re-paired 
in the second task. Since it was 
predicted that the stress group would 
make fewer errors than the nonstress 
group on these two stimuli in the 
second task, these results would 
appear to work against the hypothesis. 

Figure 1 shows the mean number of 
errors made in the second task to the 
various stimuli by the stress and the 
nonstress groups. It can be noted 
that, for both groups, more errors 
occurred on the two stimuli for which 
the response learned in the first task 
was no longer correct than to the two 
stimuli for which the previously 
learned response was correct. The 
effectiveness of the amount of training 
is revealed in the tendency for errors 
to be greater on the stimulus which 
had been presented eight times in 
the first task and for which the 
previously learned response was no 





PROACTIVE INTERFERENCE AND FACILITATION 


longer correct, and less on the stimulus 
which had been presented eight times 
but for which the previously learned 
response was still correct. Also, the 
effects of stress appear to depend on 
the type of stimulus involved, i.e., 
the stress group, in comparison to 
the nonstress group, made more errors 
on those stimuli for which the previ- 
ously learned response was no longer 
correct, but fewer errors on the stimuli 
for which the same response was 
correct. Only in the case of the 
stimuli for which the previously 
learned response was no longer cor- 
rect, however, does this latter effect 
appear to depend on the amount of 
training. 

A summary of an anlysis of variance 
based on these data, presented in 
Table 2, indicates that the main 
effects of stress and amount of training 
are not significant, while the effects of 
type of stimulus are highly significant. 
The interaction of stress and type of 
stimulus is significant, as well as the 
interaction of type of stimulus and 
training level. The triple interaction, 
indicating that the magnitude of the 
effects of stress and type of stimulus 
varied with amount of training, was 
significant at only the .10 level. 

Support for the assumptions that the 
four stimuli in the second task had 


TABLE 2 


Summary or ANALYSIS OF VARIANCE OF 
Errors Mave in tue Seconp Task 


Source MS 


Error (b) 
Stimuli (St) 
Training (T) 
St X T 
StxS 
TxsS 
StexSxT 
Error, (w) 
Error: (w) 
Error; (w) 


7.55 | 
176.96 | 31.10 
O8 | | 
| 165.96 | 62.39 | <.001 
31.17) 5.48 | <.025 
5.69| 2.14) <.20 
8.49) 3.19) <.10 
5.69 | 
2.66 
2.66 


<.001 


Stress (S) | 1.44 
| 


295 


associated with them either a correct 
habit which was relatively weak or 
strong or an incorrect habit which was 
relatively weak or strong, is evident in 
the significant main effects of stimuli 
and the significant interaction between 
stimuli and training. Considering the 
stimuli which required learning of two 
responses successively, the present find- 
ings are in line with those reported by 
Porter and Duncan (6) and Castaneda 
(1) that the amount of interference, 
under such conditions, varies as a posi- 
tive function of the associative strength 
of the first learned response. Con- 
sidering the stimuli for which the same 
response was correct in both tasks, the 
present results indicate that the amount 
of facilitation in the second task varied 
as a positive function of the associative 
strength, i.c., the amount of training in 
the first task. The significant inter- 
action between stress and stimuli is 
consistent with the two previous studies 
(2, 3) and in accord with expectations 
from the assumptions upon which the 
present study was based. It should be 
noted here that despite the fact that the 
stress group made significantly more 
errors on the unchanged stimuli than the 
nonstress group in the first task, the 
effects of stress tended to facilitate 
performance in the second task on these 
stimuli. 

The expectation that the effects of 
stress would be a positive function of 
habit strength, based on the assumption 
that habit and drive combine multipli- 
catively, was substantiated in the triple 
interaction at only the .10 level. On 
the basis of Fig. 1, the effect appeared 
more pronounced on the re-paired stimuli 
than on the stimuli which were not 
re-paired. It may be noted, however, 
that a mean of less than 2.0 errors was 
made by both groups on the one stimulus 
presented eight times in the first task 
and not re-paired, hence leaving little 
room for further improvement. 


SUMMARY 


The present experiment was concerned with 
the relation of drive strength to performance 





296 


in a complex learning situation. The specific 
theoretical implication investigated was that 
the performance of stress (high drive) Ss would 
be poorer than that of nonstress (low drive) Ss 
in a task which involved stimuli for which a 
response previously learned was no longer cor- 
rect, but that the performance of the stress Ss 
would be better on the same task involving 
stimuli for which a response previously learned 
was still correct. Further, the study investi- 
gated the implication that the greater the 
magnitude of the difference between the strength 
of the correct and the incorrect habit the greater 
would be the differences between the perform- 
ance of the high and the low drive groups. 

All 51 Ss were given 20 trials on the first task 
in which four S-R pairings had to be learned. 
Two of the stimuli were presented more times 
than the other two. In the second task, 25 Ss 
worked under a stress condition and 26 Ss 
worked under the same conditions as in the first 
task. One of the stimuli which received a 
greater amount of practice and one of the 
stimuli which received a lesser amount of 
practice were re-paired in the second task, and 
the other two stimuli were not re-paired. 

The results were in agreement with the theo- 
retical expectation that the stress Ss would 
make more errors than the nonstress Ss on the 
re-paired stimuli and fewer errors on the stimuli 
which were not re-paired. A triple interaction, 
significant at only the .10 level, was obtained 


indicating that the magnitude of the effects of 


DAVID S. PALERMO 


stress and type of stimulus vary with the 
strength of training of the stimulus. 


REFERENCES 


. Castanepa, A. Proactive interference as a 
function of amount of original training. 
Proceed. lowa Aca. Sci., 1955, 62, 411-414. 

. Castanepa, A. Effects of stress on complex 
learning and performance. J. exp. Psy- 
chol., 1956, $2, 9-12. 

. Castanepa, A., & Parermo, D. S. Psy- 
chomotor performance as a function of 
amount of training and stress. J. exp. 
Psychol., 1955, $0, 175-179. 

. Duncan, C. P. Transfer in motor learning 
as a function of degree of first-task 
learning and inter-task similarity. /. 
exp. Psychol., 1953, 45, 1-11. 

. Hutt, C. L. Principles of behavior. 
York: D. Appleton-Century, 1943. 

. Porter, L. W., & Duncan, C. P. Negative 
transfer in verbal learning. J. exp. Psy- 
chol., 1953, 46, 61-64. 

. Spence, K. W. Current interpretations of 
learning data and some recent develop- 
ments in stimulus-response theory. In 
Kentucky Symposium: Learning Theory, 
Personality Theory and Clinical Research. 
New York: Wiley, 1954. 


New 


(Received April 16, 1956) 





Journal o, 5° Psycholo, 
Vol. $3, = , 1957 aes ad 


SERIAL EFFECTS IN AUDITORY THRESHOLD 
JUDGMENTS ! 


WILLIAM J. McGILL 


Columbia University 


Recent studies have shown that 
successive threshold judgments in 
vision are not independent (2, 10, 11). 
The present paper extends this finding 
to auditory judgments and shows that 
S’s guessing habits are responsible for 
part of the nonindependence. The 
data are based on sequences of tones 
heard against a background of noise. 
The threshold involved is the thresh- 
old for masking. The judgment re- 
quired was an “absolute” identifi- 
cation of each tone. 

Serial correlation in successive “‘yes- 
no” judgments suggests autocorrela- 
tion in the underlying threshold 
process. Similar correlations are 


found in continuous threshold records 
made with the Békésy audiometer (1). 


Absolute judgments are more com- 
plicated because they afford several 
different opportunities for S to be 
wrong. Nonindependence can creep 
into such judgments in two ways. 
First, patterning can appear in a 
sequence of judgments considered 
simply as correct or incorrect. Sec- 
ondly, nonindependence can limit the 
choice of alternatives among the in- 
correct judgments. The first type 
of nonindependence is similar to 
autocorrelation in continuous thresh- 
old recording. The second type of 
nonindependence appears to depend 
on S’s guessing habits. The present 
paper is concerned with showing that 
the nonindependence in auditory ab- 
solute judgments is principally of the 


The research reported in this article was 
supported jointly by the Army, the Navy, and 
the Air Force under contract with the Massa- 
chusetts Institute of Technology. 


second type and is, in fact, due to 
S’s guessing habits. 

Threshold stimuli are indistinct, 
and S often reports that he is “gues- 
sing.” The difficulty with such re- 
ports is that S is seldom a good judge 
of how well he is doing. A more 
reliable indication of guessing, if it 
occurs, can be extracted from the 
judgments themselves (9). For ex- 
ample, one can say that S is guessing 
if his judgments are mutually cor- 
related and independent of stimuli. 
When this criterion is met the con- 
straints on successive judgments must 
be internal to S. “Guessing” then 
becomes a label for effects attributable 
to a broad class of determiners in- 
cluding set, previous training, method 
of responding, and so on. The es- 
sential feature of this class of con- 
straints is that its effects on judgments 
are the same no matter what stimulus 
has been presented to S. In short, 
guessing habits act independently of 
stimuli. It follows that guessing can 
be measured by measuring the pre- 
dictability of successive judgments 
provided that S works far enough 
below threshold to insure that judg- 
ments and stimuli are independent. 
In the ptesent experiment this re- 
striction requires that the level of the 
noise is high enough to mask the 
stimulus tones. In the limiting case 
the tones are omitted altogether. 

The present procedure is to vary 
the intensity of the stimulus tones and 
observe the predictability of succes- 
sive judgments, using the guessing 
series (stimuli omitted) as a baseline. 
To the extent that the predictability 


297 





298 


of successive judgments, when the 
tones are more or less audible, re- 
sembles the predictability in the 


guessing series, it may be inferred that 
S is guessing in the threshold series. 
The results support this inference. 


MetTuop 


The stimuli were four brief tones at constant 
intensity but differing in frequency. The 
frequencies were 890, 925, 970, 1005 cps. A 
series of these tones was presented one at a time 
in random order against a noise background. 
The series was controlled by a tape programmer. 
Four Hewlett-Packard 200-C audio-oscillators 
were connected through the programmer to a 
narrow bandpass filter which went to a mixing 
circuit where the signals from the oscillators 
were added to random noise generated by a 6D-4 
gas tube. The program sampled a tone from 
ore of the oscillators every 2.8 sec. Thus S 
heard one of four randomly chosen tones every 
2.8 sec. The tone lasted for .5 sec. and was 
superimposed on a steady background of noise. 
While the tone sounded, a light flashed directly 
in front of S to signal the presentation of the 
stimulus. Each oscillator was set to generate 
one of the four tones. The larger gap between 
the middle tones helped to equalize the difficulty 
of the judgments. Listening was binaural. 
The earphones were type PDR-8. Measure- 
ments of signal and noise intensity were made at 
the input to the earphones. Stimulus intensity 
is reported as the ratio of signal power to the 
noise power per cycle or spectrum level of the 
noise. The intensity of the tones was fixed, 
and signal-to-noise ratio was changed by varying 
the noise. 

The S was given preliminary training in 
which he learned to make “absolute” judgments 
(identifications) of the four tones by pressing 
four response keys. He was instructed to guess 
when in doubt. During training S was corrected 
freely, but no information about the accuracy 
of his judgments was given during experimental 
sessions. Before each session S was required to 
pass a pretest consisting of 100 tones. The 
passing criterion was 90% correct. In the 
pretest no masking noise was added. However, 
the narrow gap between adjacent frequencies 
produced confusions even in the absence of 
masking noise. Frequent retraining was neces- 
sary in order to maintain the pretest standard 
of accuracy. 

Fach S was tested at 15, 22, 25, 30, 35 db. 
Each test consisted of a series of 250 stimuli. 
In addition, a guessing series was obtained on 
each S, This series resembled the others except 


WILLIAM J. McGILL 


that the stimuli were disconnected from the 
input to the earphones. The tones appeared 
to be masked completely by noise, and S indi- 
cated his guess by pressing one of the response 
keys whenever the stimulus light flashed. The 
S was not informed that there were no stimuli. 

Tests were run in a modified counterbalanced 
order. The series at 15 db. and the guessing 
series were always done last. The other four 
intensities were arranged in a latin square with 
Ss. 

Four M.LT. undergraduates served as Ss. 
Each S provided 1500 judgments. 


RESULTS 


Figure 1 shows the combined psy- 
chometric function for intensity of 
the tones above the masking noise. 
The curve in Fig. | is the average of 
the curves determined at 890, 925, 
970, and 1005 cps. Each point is 
based on 250 judgments by each of 
four Ss. The function in Fig. 1 is 
pushed down or compressed from 
above by the difficulty level of the 
absolute judgments, since the pretest 
criterion was barely met in most 
sessions. The curve is also com- 
pressed from below by the probability 
of chance success—which in this case 
is .25. These compressions make the 
choice of a unique threshold point 
somewhat arbitrary. The usual pro- 





= 


5 


Proportion Correct 


° 
Nn 
S 





' 
Signo! to Noise Rotio (db) 





Fic. 1. Psychometric function for absolute 
judgments of brief tones superimposed on back- 
ground of masking noise. Data are averaged 
over four tones: 890, 925, 970, 1005 eps. 





SERIAL EFFECTS IN AUDITORY JUDGMENTS 


cedure is to adjust the observed 
proportions to allow for chance suc- 
cess. The proportion corresponding 


to threshold is then computed from: 


k+1 
“3 ‘fe4 


where & is the number of forced choice 
alternatives. In Fig. 1 this procedure 
fixes the threshold at .63, or approxi- 
mately 24 db. which is considerably 
higher than the figure of 18 db. found 
by the method of adjustment (6). 
This seems to reflect the difficulty of 
identifying four tones instead of one. 


The chance correction, however, is 
highly questionable with the present 
data because it assumes that successive 
judgments are independent, and it will 
be demonstrated that this is not the case. 
In fact, the threshold level is so high as 
to suggest that some odd things may 
occur in absolute judgments. The S is 
forced to do a lot of guessing, and this 
may somehow affect his sensitivity. In 
any case, the psychometric function is 
quite fuzzy because successive judgments 
are not independent. A better picture 
of what is going on is achieved by intro- 
ducing the idea of response predicta- 
bility. Wherever the four tones are cor- 
related with the responses made by S, 
the stimuli may be said to predict the 
responses up to the extent of their 
influence. Similar predictions are pos- 
sible from S’s previous responses. The 
total predictability of S’s current judg- 
ment may be expected to involve 
elements of both classes of predictors. 

The principal results trace shifts in 
predictability from the stimulus to §’s 
last previous response (hereafter called 
the “presponse”’) as the level of the 
masking noise and, consequently, the 
difficulty of the judgment varies in the 
region of the threshold. 

In order to measure the effects of two 
predictors simultaneously, the data are 
treated by uncertainty analysis as de- 
scribed by McGill (8) and Garner and 
McGill (5). This method was also used 


299 


by Garner to study presponse effects in 
loudness judgments (4). In the present 
paper the terms “information” and 
“uncertainty” are used interchangeably. 
These two terms sometimes describe 
different aspects of information measures 
(5), but the differences are unimportant 
here. 

In the present application of uncer- 
tainty analysis, the data are arranged in 
a three-dimensional contingency table. 
Each response is categorized according 
to S§’s identification of it, the stimulus 
actually presented, and S’s presponse. 
Let w,; represent the ith category of 
presponses; x,;, the jth category of 
stimuli; and y, the &th category of §’s 
identifications. Then mj; is the number 
of responses in the cell of the contingency 
table determined by the intersection of 
categories, i, j, and &. Marginal sums 
are designated by replacing indexes with 
dots. For example, nj. is the number 
of times § puts stimulus / into response 
category &. The total uncertainty (in- 
formation) in S’s responses is measured 
as follows: 


U(y) = — z|" ‘| oe,” ‘|. (1) 
k n n 


In Equation 1, » is the total number of 


responses. It is easily shown that 


U(y) = U,(y) + U(y:*), (2) 
where U,(y) is the uncertainty remaining 
in S’s responses when the stimulus is 
known, and U(y:x) is the uncertainty 
in §’s responses that is predicted by the 
stimulus. U(y:x) is called transmitted 
information or contingent uncertainty 
(5) and is similar to the variance of 
points on a regression line in metric data. 
The uncertainty contingent on stimuli, 
i.e., the last term in Equation 2, is 
computed as shown in Equation 3: 


U(y:x) = —- ZL "| 


i,* od 


x loe,| — ‘| (3) 
nn jk 


The last step in the analysis is to 
extract the presponse component from 





300 
the error term in Equation 2. This is 
done as follows: 


The new error term is Uys(y). The 
presponse component is the second term 
on the right of Equation 4. The 
formula is given in Equation 5: 


U,(y:) = — || 
es be ad 


x loge 2. (5) 
Nijk Nj. 

The quantity in Equation 5 is the 
uncertainty contingent on previous re- 
sponses. It is determined separately for 
each stimulus category and then aver- 
aged. The averaging essentially partials 
out the stimulus variable and makes 
Equation 5 a close relative of partial 
correlation. 


The response uncertainty predicted 
by the four stimulus tones (Equation 
3) is presented in the column headed 
“Stimulus Component” in Table 1, 
which shows how the predictability of 
S’s judgments changes with signal-to- 
noise ratio. All data are measured 
in bits and averaged over the four Ss. 

Response uncertainty (i.e., total 
uncertainty determined from Equa- 
tion 1) is stable at approximately two 
bits at each stimulus intensity and in 
the guessing series. This means that 


TABLE 1 


Cuances in Stimutus Component oF 
Uncertainty in S’s JupGMents 
Propucep py Cuaneoine In- 
TENSITY OF STIMULUS 
Tones 





Stimulus Total 
Uncertainty 


Relative 
Intensity (db.) 


Guessing 


1.97 
1.97 
2.01 
1.97 
1.99 
2.01 








WILLIAM J. McGILL 





ah 


—_ 


Predicted trom 
Stimuli ond Presponses 


Uncertainty in Bits 


Predicted from 
Stimuli Alone 


° 
$ 


ii Met a a 


ea ae a 


es 


T 











i) 20 25 
Signo! © Noise Ratio (ab) 


Fic. 2. Uncertainty in S’s judgments 
predicted jointly from stimulus tones and S’s 
last previous response. 


Ss used the four frequency categories 
equally often, whether they heard the 
tones or not. Two measures of total 
uncertainty in. Table 1 exceed two 
bits, indicating an occasional failure 
to respond. Failure to respond was 
treated as a fifth response category. 
As suggested previously, S’s judg- 
ments are predictable from stimuli 
and presponses jointly. Curvesrepre- 
senting the joint prediction and the 
prediction based on stimuli alone are 
compared in Fig. 2. The joint pre- 
diction in Fig. 2 is the nonmetric 
analogue of a multiple correlation 
derived from the three-dimensional 
data matrix. The uncertainty com- 
ponent predicted from stimuli is the 
one just presented in Table 1. It is 
based on the two-dimensional matrix 
relating stimuli and S’s responses. 
This component increases with in- 
tensity in about the same way as the 
proportion of stimuli correctly identi- 
fied. Contingent uncertainty, how- 
ever, is not compressed from below 
by chance successes. When S 
guesses, the stimulus component drops 
to zero because stimuli and responses 
are independent. Actually this is not 
strictly true, since it turns out that 





SERIAL EFFECTS IN AUDITORY JUDGMENTS 


contingent uncertainty is biased away 
from zero even when stimuli and re- 
sponses are independent. The bias, 


however, has been shown (7) to be 
approximately equal to 


(j — Dk — 1) 
1.3863n =” 





where j is the number of stimulus 
categories, k is the number of response 
categories, and nm is the number of 
responses. In the present data the 
bias amounts to .026 bits. This is 
roughly the magnitude of the stimulus 
component shown for the guessing 
series in Table 1. In the guessing 
series a stimulus tape was placed in 
the programmer each time, and S 
responded in the usual way. How- 
ever, the programmer was discon- 
nected from S’s earphone. Conse- 
quently, any information predicted 
from stimuli is due either to good luck 
or to ESP, neither of which seems to 
have occurred. 

The component of response uncer- 
tainty predicted from presponses is 
measured with stimuli held constant. 
As previously noted, this uncertainty 
component is determined separately 
for each stimulus and then averaged. 


TABLE 2 


Cuances 1n Presponse Component oF 
Uncertainty 1n S’s JupGmMEents 
Propucep sy Cuanoinc In- 
TENSITY OF STIMULUS 

Tones 





Size of Presponse Component 


Relative 
Intensity (db.) 





Obtained | Maximum 


Guessing 
| 

















3 





Presponse Uncertointy 
°o 








Signol to Noise Retio (db) 


Fic. 3. Uncertainty in S’s judgments 
predicted from S’s last previous response, with 
stimulus held constant. Presponse influences 
on judgments are presented separately for odd- 
and even-numbered responses. 


Presponse components obtained at 
each stimulus intensity are shown in 
Table 2. The presponse effect is 
largest in the guessing series and de- 
clines slowly up to 22 db. There is 
a sharper drop between 22 and 25 db., 
after which the presponse effect is 
stable. 

Figure 3 shows the presponse com- 
ponent at each level of stimulus 
intensity. Separate curves are pre- 
sented for the odd-numbered and 
even-numbered responses. This pro- 
cedure is followed to prevent counting 
the same response twice—once as a 
presponse and again as a response. 
In Fig. 3 the odd-numbered responses 
are presponses for the function based 
on even-numbered responses and vice 
versa. Significance level is obtained 
from the likelihood ratio test for 
association between presponses and 
responses with stimuli held constant. 
The critical value of chi square at 
the 5% level is used to calculate a 
presponse component significantly dif- 
ferent from zero, and a dashed line is 
drawn across Fig. 3 at this point. 
The presponse component of response 
uncertainty drops below the critical 
value at 25 db. and remains below 
thereafter. Additional calculations 
recommended by Miller and Madow 
(7) reveal that the observed presponse 
component above 25 db. is about 
equivalent to the statistical bias ex- 





302 


pected when successive responses are 
independent. ‘The difference between 
the sizes of the presponse components 
in Table 2 and Fig. 3 is due to inflated 
statistical bias when the number of 
observations is halved. The differ- 
ence between the two curves in Fig. 
3, however, is not easy to explain. 
It may reflect a periodicity (of the 
autocorrelational type) in the data. 
Table 2 also presents the maximum 
size and the relative size of the 
presponse component. The pre- 
sponse component is extracted from 
the response uncertainty when the 
stimulus is known, i.e., from S’s 
identification errors. Equation 4 
shows that the latter determine the 
maximum value of the presponse 
component for any given stimulus 
intensity. The maximum, U,(y), is 
computed by subtracting the stimulus 
component from the total uncertainty 
in Table 1. This difference is given 
in Table 2. The relative size of the 


presponse component is determined 
from the ratio of the obtained size 


to the maximum size. This ratio is 
practically constant. ‘There is noth- 
ing in the analysis that forces this 
result. The picture appears to be the 
following. In the two-dimensional 
matrix relating stimuli and responses, 
the number of S’s responses that fall 
in the correct-response cell increases 
with S/N ratio. But the distribution 
of S’s incorrect responses remains 
stable, so that a constant proportion 
of these incorrect responses can be 
predicted from presponses. Hence, 
the patterning of incorrect responses 
seems to be determined by presponses 
and reflects S’s nonrandom guessing 
habits. The type of patterning is 
not revealed by the analysis, but the 
presponse-response matrices disclose 
that S tends to avoid repeating his 
last response. 


WILLIAM J. McGILL 


Discussion 


This experiment demonstrates that 
presponses have a marked effect on 
absolute judgments when stimulus in- 
tensity is at or slightly below threshold. 
Hence, successive judgments are not 
independent. Furthermore, the effect 
blends smoothly with S’s judgments in 
the guessing series. The combined 
effects appear to reflect a lawful process 
in absolute judgments. Consequently, 
it is worth while to try to identify the 
variables that affect S’s guessing habits. 

One variable appears to be the number 
of response alternatives, especially when 
S has been set by training or instructions 
torespond randomly. Experiments with 
other guessing series, not reported here, 
indicate that presponse influences are 
small with two-category forced choice 
judgments. Garner (4) has shown that 
presponse influences change systemati- 
cally with the number of alternatives 
between 5 and 20 categories. A second 
variable is the time interval between 
successive judgments. In a check ex- 
periment at 25 db. on the Ss in this 
experiment, it was observed that the 
presponse component declined by small 
amounts but consistently as the inter- 
stimulus interval increased. Tests were 
made at intervals of 2.8, 5.6, 8, and 11 
sec. Day’s results (3) show that the 
most important presponse influences are 
found with time intervals shorter than 
2.1 sec. 

A third variable controlling presponse 
influences is the difficulty level of the 
absolute judgments. The variation of 
the presponse effect in the vicinity of the 
threshold demonstrated in this experi- 
ment illustrates this control. In another 
check experiment on Ss in this experi- 
ment, the difficulty level of the judgment 
was eased by increasing the highest 
frequency from 1005 to 1075 cps. Tests 
were run at 20, 25, 30, and 35 db. ‘In- 
formation transmitted from stimuli ex- 
ceeded the value reported in Table 1 at 
each intensity. Presponse components 
were correspondingly lower at each 
intensity. 





SERIAL EFFECTS IN AUDITORY JUDGMENTS 


In this study presponse influences were 
limited to a single prior response. In- 
fluences from earlier responses were thus 
relegated to the error term of the in- 
formation analysis. It is possible that 
genuine long-term influences exist and 
that their extent depends on the variables 
described above. Long-term presponse 
influences have been claimed, but an 
important methodological consideration 
has been overlooked. Suppose only the 
last previous response affects a judgment. 
This influence will be propagated spuri- 
ously over the entire series of judgments 
because, if response a affects response 4, 
and é affects c, then a will also appear to 
affect ¢ unless 4 is partialed out. Con- 
sequently autocorrelational studies of 
response sequences reveal little about 
the structure of long-term influences. 
In fact, autocorrelations may be very 
misleading since they often indicate 
much more complicated structures than 
actually exist. What is required is a 
study of long-term influences with inter- 
vening effects partialed out, but data of 
this kind are easier to conjecture than 
to obtain. 

SUMMARY 


Four Ss learned to identify tones at 890, 925, 
970, 1005 cps. The Ss then judged sequences 
of 250 brief stimuli delivered in random order 
at constant intensity. The difficulty of the 
judgment was varied by varying the level of a 
background masking noise. The data were then 
analyzed to determine the joint effect of the 
stimulus and S’s guessing habits (called pre- 
sponse effects) in determining judgments. 
Results were as follows: 


1. A marked presponse effect was observed 
for signal-to-noise ratios lower than 22 db. This 
corresponds to judgments at or below the average 
value of the threshold. 

2. The presponse influence was greatest when 
no stimuli were presented. (The Ss were un- 
aware that there were no stimuli.) The 
presponse influence declined gradually as tones 
were introduced and made more intense. 


303 


3. Information analysis showed that the 
frequency of S’s judgment errors was controlled 
by stimulus intensity, while the pattern of errors 
was governed by S’s guessing habits. 


REFERENCES 
. Béxésy, G. von. A new 
Acta Oto-Laryng., Stockh., 


411-422. 
. Cottier, G. 


audiometer. 
1947b, 35, 


Probability of response and 
intertrial association as functions of 
monocular and binocular stimulation. 
J. exp. Psychol., 1954, 47, 75-83. 

. Day, W. Serial nonrandomness in auditory 
thresholds as a function of interstimulus 
interval. Amer. J. Psychol., 1956, 69, 
387-394. 

. Garner, W.R. An informational analysis 
of absolute judgments of loudness. /. 
exp. Psychol., 1953, 46, 373-380. 

. Garner, W. R., & McGus, W. J. The 
relation between information and vari- 
ance analyses. Psychometrika, 1956, 21, 
219-228. 

Hawkins, J. E., & Stevens, S. S. The 
masking of pure tones and speech by 
white noise. J. acoust. Soc. Amer., 1950, 
22, 6-13. 

. Mitten, G. A., & Mavow, W. On the 
maximum likelihood estimate of the 
Shannon-Wiener measure of information. 
Cambridge: USAF Cambridge Res. Cen- 
ter, Tech. Rep. No. TR-54-75, 1954. 

. McGuu, W. J. Multivariate information 
transmission. Psychometrika, 1954, 19, 
97-116. 

. Senvers, V. L., & Sowarps, A. Analysis 
of response sequences in the setting of a 
psychological experiment. Amer. J. Pry- 
chol., 1952, 65, 358-374. 

. Verrpranckx, W. S., Cortier, G. HL, & 
Corton, J. W. Nonindependence of 
successive responses in measurements of 
the visual threshold. J. exp. Psychol., 
1952, 44, 273-282. 

. Verrranckx, W. S., Corton, J. W., & 
Coiuer, G. H. Previous training as a 
determinant of response dependency at 
the threshold. J. exp. Psychol., 1953, 
46, 10-14. 


(Received March 21, 1956) 





Journal 


Experimental Psychology 
Vol. 53 


o. 5, 1957 


‘s 


THE ABSTRACTION AND CONCEPTUALIZATION OF 
FORM, COLOR, AND NUMBER! 


JOACHIM F. WOHLWILL 


University of California 


The question of the relative facility 
with which concepts of form, color, 
and number are attained in different 
situations has been extensively in- 
vestigated by Heidbreder and her 
co-workers (8, 10, 11), and subse- 
quently by Grant and his associates 
(3, 4) using the Wisconsin Card 
Sorting Test (WCST). The results 
from these studies have shown, in 
general, that abstract forms are more 
readily conceptualized than colors or 
numbers, although with respect to the 
latter, the two series of studies have 
yielded somewhat contradictory 
results. 

Heidbreder (6) has maintained that 
such studies give evidence of a domi- 
nance-hierarchy of concepts, extend- 


ing from concepts of concrete objects 
to those of number, according to their 


degree of “thing-character.”” The 
nature and origin of this hierarchy 
remains, however, uncertain in a 
number of respects. One of these 
concerns the place of perception in 
this hierarchy of concepts. Specifi- 
cally, is readiness for conceptuali- 
zation primarily a function of facility 
of perceptual abstraction of a given 
stimulus attribute, or are conceptual 
predispositions responsible for it 
which are further removed from the 
perceptual process? 

While Heidbreder’s experiments (8) 
actually show a decreasing influence 
of the hierarchy of concepts as the 
role of direct perception in a given 
task increases and therefore argue 


1The writer gratefully acknowledges the 
generous advice and criticism offered by Dr. 


Donald A. Riley. 


304 


against the former of the above 
alternatives, she nevertheless goes to 
some length in one of her more recent 
articles to explain that the order of 
dominance of concepts as advocated 
by her is primarily one of perceptual 
rather than conceptual processes (9, 
pp. 117-118). 

One way of exploring this problem 
further would be to compare the order 
of dominance of stimulus attributes in 
two contrasting tasks involving identi- 
cal stimulus materials—one of a 
problem-solving nature, in which hy- 
potheses based in part on S’s pre- 
dispositions might be displayed, the 
other of a perceptual nature, in which 
the response to be made is completely 
defined for S. 

A concept formation experiment 
closely similar to the WCST, but 
using probabilistic reinforcement of 
the different concepts, was selected 
for the first of these tasks. By using 
probabilistic reinforcement and thus 
preventing S from finding a con- 
sistently correct mode of response, it 
was hoped to maximize any tendencies 
on the part of S to respond to one 
rather than another of the various 
stimulus-cues. 

The second task, a card-sorting 
task in which speed of response to a 
predetermined stimulus dimension of 
the card was measured, provided for 
a comparison of these same cues in 
terms of their relative ease of ab- 
straction. As used in this paper, this 
term abstraction is taken to refer to 
that aspect of the discrimination 
process which involves the response 
to a given sensory dimension in the 





ABSTRACTION AND CONCEPTUALIZATION 


presence of variation of the stimuli 
in other dimensions. An operational 
definition for the measurement of 
speed of abstraction in this sense is 
presented later. 


MetTHopD 
Part Il 


This part of the experiment concerns the 
comparison of concepts of form, color, and 
number in a probabilistic concept formation 
task. 

Stimuli.—Forty-eight response cards from 
the WCST were selected, together with the four 
WCST stimulus cards. These varied in form 
(triangle: T; star: S; cross: X; circle: C), color 
(red: r; blue: b; green: g; yellow: y) and 
number (one, two, three, four). Every response 
card matched any given stimulus card on no 
more than one dimension. 

Procedure.—The S was seated in front of a 
sorting board with four compartments, behind 
which the four stimulus cards had been placed 
in the following order: 2gS, 4bC, 3yX, IrT. He 
was given the deck of 48 response cards, with 
the following instructions: 

“ .. . You are to sort this deck of cards in 
front of you one by one into the compartment 
underneath the stimulus card with which you 
think the card to be sorted belongs. After you 
have sorted each card, I will tell you whether 
you were right or wrong. Your purpose should 
be to get as many correct as you can. While 
this is not a timed task, you should not spend 
too much time on any one card, but rather try 
to sort at a fairly even rate.... Always wait 
for my announcement of ‘right’ or ‘wrong’ before 
proceeding to sort the next card.” 

Conditions of reinforcement.—Reinforcement 
schedules were prepared in advance, such that 
for two-thirds of the cards in the deck one of the 
three stimulus dimensions would be the correct 
basis for sorting, while reinforcements on the 
remaining third of the deck were divided up 
equally between the other two dimensions. For 
one-third of the Ss the “primary” concept (the 
one most often reinforced) was form, for one- 
third it was color, and for one-third number. 
As an example of this arrangement, with number 
as the primary concept, the correct basis for 
sorting on the first 10 cards varied among Form 
(F), Color (C) and Number (N) as follows: 
N, N, N, F, N, N, C, N, F, N. 


Part I1 


In Part II the speed of abstraction of the 
form, color, and number aspects of the WCST 


305 


cards was compared. It could not be assumed, 
however, that the particular forms represented 
in these cards were equal in discriminability to 
the particular colors or numbers represented, 
i.e., that the perceptual information transmitted 
by each stimulus-cue was the same. In order 
to control for this factor, these forms, colors, 
and numbers had to be presented independently 
of variations in the other dimensions, the 
resultant sorting times serving as a base line 
against which to evaluate differential speed of 
abstraction? This was the purpose of the 
“simple” cards, presently to be described. 

Stimuli.—The cards used in this part of the 
experiment were of three kinds: “simple,” 
“mixed,” and “practice.” 

“Simple” cards: Three sets of 24 simple cards 
were used, as follows: a set of 24 simple-form 
cards, varying only in form and consisting of six 
replicas of each of the four geometric shapes of 
the WCST cards, centered on the card in solid 
black; a set of 24 simple-color cards, varying 
only in color and consisting of six replicas of 
each of the four colors of the WCST, in the form 
of squares centered on the card; lastly, a set of 
24 simple-number cards, varying only in number 
and consisting of six instances of from one to 
four black squares in different configurational 
patterns. 

“Mixed” cards: Six sets of 24 mixed cards 
were used for the determination of differential 
speed of abstraction, two sets to be sorted by 
form, two by color, and two by number, these 
sets being made up of cards from the WCST.* 

“Practice” cards: Two sets of 24 practice 
cards were used, these being identical with one 
of the four WCST stimulus cards in every 
respect, so that sorting them did not involve 
abstraction of any one stimulus dimension. 

Each of the above sets of response cards had 
associated with it a set of four stimulus cards. 
For the simple card sets these contained the four 
types of symbols represented in the respective 





2 The validity of sorting times as a measure 
of the perceptual information transmitted by a 
set of stimuli has been demonstrated, at least 
for the continuum of number, by Crossman (2). 

* These mixed cards were actually of two 
different kinds, representing the “single cue” and 
the “double cue” condition, respectively. A 
card in the single cue sets agreed with the 
stimulus card with which it was to be sorted only 
on the dimension indicated as the basis for 
sorting, while a card in the double cue sets 
agreed with its stimulus card on that dimension 
and on an additional one which changed from 
card to card. But since the results with respect 
to this variable were inconclusive, further men- 
tion of it has been omitted from this paper. 





306 JOACHIM F 


set (e.g., a black triangle, circle, cross, and star, 
for the simple-form set); for the mixed sets and 
the practice sets the stimulus cards of the WCST 
(cf. Part I) were used. 

All cards were of thin cardboard, 3 X 3 in. 

Procedure.—Part II followed directly upon 
completion of PartI. The Ss were divided into 
three groups, according to order of sorting on the 
three dimensions (FCN, CNF, and NFC). 

All Ss started with one of the practice sets, 
followed by a series of six decks of cards in 
different orders. For Group FCN the order 
was: simple-form; mixed, to be sorted by form; 
simple-color; mixed, to be sorted by color; 
simple-number; mixed, to be sorted by number. 
The order was changed accordingly for Ss in the 
other groups, simple-card sorting for each di- 
mension always preceding mixed-card sorting. 
Lastly, all Ss received the second of the practice 
sets. 

The S was asked to sort each deck according 
to the dimension indicated by E, and to do so as 
fast as possible, consistent with accuracy. 
Sorting times were recorded to the nearest .1 
sec. by means of an electric timer. 

Subjects.—-The Ss were 24 male and 24 female 
undergraduates, enrolled in psychology courses 
at the University of California. Due to a slight 
revision in experimental procedure for Part II, 
data for that part are based on only 18 Ss of 
each sex. 


RESULTS 


Part l 


The main measure in the analysis 
of the results for Part I is the number 
of cards sorted according to the 
primary concept, i.e., the one rein- 
forced on two-thirds of the cards. 
The means of the form, color, and 
number groups for successive blocks 
of 12 trials are presented in Table 1. 
As is shown in this table, Ss responded 
more consistently to the primary cue 


TABLE 1 


Mean Numper or Carns Sorrep By 
Primary Cue 








Block of Trials 





1 
8.9 
5.4 
7.1 

















. WOHLWILL 


when it was form or number than 
when it was color, particularly in the 
first stages of the task. 

The statistical analysis of these 
data presents certain difficulties, due 
to the uncertain form of the distri- 
bution of the scores. These are 
characterized both by marked hetero- 
geneity of variance and fairly radical 
departure from normality. It was 
therefore decided to resort to a non- 
parametric analysis of variance re- 
cently proposed by Wilson (14), 
based on the number of cases above 
and below the median in each cell. 
Due to the lack of independence of 
the scores for successive blocks of 
trials, this analysis had to be limited 
to the total number of cards out of 
48 sorted by the primary cue. The 


resultant x? for cues was 9.50, signifi- 
cant beyond the .O1 level.‘ 

In order to corroborate these find- 
ings, the same analysis of variance 
was carried out on the maximum 
number of successive cards sorted by 


the primary cue for each S. The 
median length of such runs for each 
cue was: Form, 13; Color, 9.5; 
Number, 14. The corresponding x? 
for cues is 12.27, significant beyond 
the .01 level. 

On both measures, then, color is 
less consistently responded to than 
either form or number when these 
are the primary reinforced cues, while 
form and number appeared to be 
about equally effective.® 


‘ Although it is felt that the nature of these 
scores demands the nonparametric analysis, 
mention should be made of the results obtained 
from a Fisherian analysis of variance on these 
same scores. This yielded an F for sorting 
categories of 2.91, with 3.22 being required for 
the .05 level of significance. A subsequent t 
test for the difference between form and color 
resulted in a t of 2.33, significant between the 
OS and .02 levels. 

5A breakdown of the data by sex discloses 
that for the male Ss the response to number 
actually exceeded that to form in consistency in 





ABSTRACTION AND CONCEPTUALIZATION 


It may perhaps aid in the understanding of 
these results to give a brief qualitative descrip- 
tion of the mode of performance most commonly 
observed in this situation. After the first two 
or three nonreinforced cards, S would usually 
try out a number of different hypotheses in the 
search of one which might pay off more con- 
sistently. These consisted not only of the three 
main cues, but also of combinations of these, as 
well as of positional and configurational cues. 
As the sorting went on, however, the primary 
cue was usually responded to more consistently 
regardless of nonreinforcements, this being es- 
pecially true for form and number (thus the 
rather high averages on the maximum-run 
measure), As a whole, this type of response 
bears rather little resemblance to the probability- 
matching typically observed in two-choice ex- 
periments with probabilistic reinforcement. 
The difference is due largely, one suspects, to 
the much larger variety of possible hypotheses 
in this situation and the opportunity given to S 
to test them, by letting him sort at his own rate. 


Part II 


The results from Part II are sum- 
marized in Table 2. With respect to 
the simple-card sorting times, the 
significant differences among cues 
presumably reflect differences in dis- 
criminability of the particular in- 
stances of each category represented 
in the WCST cards. These differ- 
ences clearly contaminate the mixed- 
card sorting times, which are thus of 
little value in themselves. Attention 
will be focused, the:efore, on the 
“abstraction scores.” These were ob- 
tained for each S by subtracting his 
simple-card times from his mixed-card 
times. The resulting difference scores 
indicate the extent to which the 
the last half of the task, with a mean of 21 out 
of the last 24 cards being sorted by number, 
compared to 19.5 sorted by form. This is of 
interest in the light of Grant and Curran’s (3) 
finding that number sorting gives rise to a dis- 
proportionate amount of errors of perseverance 
on the WCST, after the correct sorting category 
is shifted. However, the female Ss in the 
present sample failed to show any similar ad- 
herence to number as a primary cue. Since the 
data, moreover, do not permit a trend analysis, 
the results can only be held to be suggestive as 
regards this possible characteristic of number. 


TABLE 2 


Mean Sortinc Times (tw Sec.) anno 
AsstractTion Scores, BY 
Sortinc CaTEcory 


Sorting | Simple | Mixed 
Category | Cards 
4.9 
19.8 
24.5 


Form 
Color 
Number 


56.8* 


F for cues 


*P < Ol. 





irrelevant dimensions interfered with 
sorting on a given category, i.e., the 
relative difficulty of the abstraction 
of these stimulus attributes. They 
cannot, however, be taken in any 
absolute sense (i.e., where negative, 
as indicating facilitation from the 
other dimensions), due to probable 
time-order effects of proactive facili- 
tation from the simple cards to the 
mixed cards. 

As seen in Table 2, these abstrac- 
tion scores show unequivocally that 
color is more easily abstracted than 
form, in contrast to the findings from 
Part I. Number occupies here an 
intermediate place, although much 
closer to color than to form. 


With respect to the number cue, there is 
actually some reason to doubt that the abstrac- 
tion scores represent an altogether valid measure 
of relative ease of abstraction. It seems likely 
that the rather high simple-number times were 
due to some extent to the 2-4-3-1 order in which 
the stimulus cards were arranged (this being 
necessary in order to prevent Ss from ignoring 
the stimulus cards when sorting for number, as 
they might do if these were arranged in numerical 
order). Although this factor was still operating 
in the mixed-card sorting, the practice on the 
preceding simple-number cards probably served 
to weaken it considerably, which may help 
account for the mean abstraction score of zero 
for number. 


Discussion 


It is apparent that there is little if any 
correspondence between the results of 





308 JOACHIM F. 


the two parts of this experiment, par- 
ticularly as regards the relative position 
of color and form. In the first part, 
form is “dominant” over color, in the 
sense of representing a more stable and 
consistent conceptual category. In the 
second part, on the contrary, color is 
considerably easier to abstract. 

This latter finding is, incidentally, in 
line with that obtained in a previous 
study by Hamlin, Stone, and Moskowitz 
(5), although in this study differences in 
discriminability of the form- and color- 
instances used were not controlled. Ths 
most plausible explanation for thie 
superiority of color probably lies in 
perceptual factors inherent in these 
stimulus dimensions, such as the re- 
dundancy in the stimulus with respect 
to the relevant dimension, this being 
clearly greater for color than for form. 

Yet the fact remains that in a prob- 
lem-solving situation such as Part I of 
the experiment, or a task like the WCST, 
as well as in Heidbreder’s “modified 


memory” series (10) and other concept- 
learning experiments (e.g. 12), form is a 
preferred aspect of the stimulus to which 
to respond. Although the explanation 


for this dominance of form is far from 
clear, it seems reasonable to attribute it 
to the operation of response-sets which 
the individual brings to the situation and 
which have been formed during the 
course of his mental development and 
experience. Some support for such a 
view derives from genetic studies of 
color-form preference, which seem to 
point to a change from color- to form- 
dominance in early and later childhood 
(cf. 1, 13). 

The relevance of the present results to 
Heidbreder’s theoretical formulation of 
the conceptual process deserves discus- 
sion. One might take them to support 
Heidbreder’s distinction between situa- 
tional and organismic determinants of 
conceptual performance, although the 
operational basis for such a distinction 
has yet to be established. The important 
point to note, in any event, is that if one 
may speak of an organismic order of 
dominance of concepts at all, it must be 


WOHLWILL 


considered one of conceptual rather than 
perceptual processes. 

Perhaps the most questionable aspect 
of Heidbreder’s conception, however, is 
her notion of the “thing-character” of a 
concept as the basis for this order of 
dominance. This is most apparent in 
regard to the place of number, a con- 
ceptual category relegated to the bottom 
of the hierarchy by Heidbreder, due to 
the inherently symbolic character she 
attributes to it (7, pp. 218-220). Ac- 
tually, the results of Part I show the 
number category to be closely com- 
parable to form, especially after the 
initial stage of the task, as witness the 
medians on the maximum-run measure. 

In defense of Heidbreder it may be 
argued that the WCST cards do not 
contain instances of concrete objects, 
with respect to which her thing-character 
hypothesis was most consistently con- 
firmed in her own experiments. But 
the difficulties encountered in explaining 
the unstable place of color in her results 
(cf. 11, pp. 65-66), as well as the above- 
mentioned conflicting evidence with 
respect to number, would seem to cast 
doubt on the adequacy of “thing- 
character” as an explanatory concept of 
more general applicability. 

In conclusion, the marked discrepancy 
between the results of the two parts of 
this experiment suggests the advisa- 
bility of distinguishing more clearly than 
has usually been done heretofore between 
two evidently rather different cognitive 
processes: one, abstraction, being a se- 
lective response to a given aspect of a 
stimulus which is probably involved to 
a greater or lesser extent in any act of 
perceptual discrimination; the other, 
conceptualization, being perhaps more 
profitably considered as a process of 
mediated generalization, and as such 
being primarily a function of the prior 
experience and stage of mental develop- 
ment of the individual. 


SUMMARY 


The relative strength of response to the form, 
color, and number dimensions of the Wisconsin 
Card Sorting Test cards was compared in two 





ABSTRACTION AND CONCEPTUALIZATION 309 


tasks: a problem-solving type task using proba- 
bilistic reinforcement of these cues, and a per- 
ceptual abstraction task involving speed of 
sorting on the three stimulus dimensions. 

The results indicated that when form or 
number were the primary reinforced concepts, 
they were more consistently responded to than 
when it was color. On the other hand, form 
was much less readily abstracted than color or 
number in the second task, this being true even 
after differences in discriminability of the par- 
ticular colors, forms, and numbers represented 
in these cards had been taken into account. 

These contrasting results suggest the need for 
differentiating between a perceptually deter- 
mined process of abstraction and a supra- 
perceptual process of conceptualization. With 
respect to the latter, certain difficulties for 
Heidbreder’s formulation of a dominance- 
hierarchy of concepts based on “thing-character” 
are pointed out, the superior showing of number 
being especially noteworthy. 


REFERENCES 


1. Brian, C. R., & Goopenoven, F. L. The 
relative potency of color and form per- 
ception at various ages. J. exp. Psychol., 
1929, 12, 197-213. 

2. Crossman, E.R. F.W. The measurement 
of discriminability. Quart. J. exp. Psy- 
chol., 1955, 7, 176-196. 

3. Grant, D. A., & Curran, J. F. Relative 
difficulty of number, form and color 
concepts of a Weigl-type problem using 
unsystematic number cards. J. exp. 
Psychol., 1953, 43, 408-412. 

4. Grant, D. A., Jones, O. R., & TaLtantis, B. 
The relative difficulty of number, form 


and color concepts of a Weigl-type prob- 
lem. J. exp. Psychol., 1949, 39, 552-557. 

5. Hamu, R. M., Stone, J. T., & Mosxowrrz, 
M. J. Rorschach color theories as re- 
flected in simple card sorting problems. 
J. proj. Tech., V9S5, 19, 410-415. 

. Hewsreper, E. Toward a dynamic psy- 
chology of cognition. Psychol. Rev., 
1945, 52, 1-22. 

. Hemwsreper, E. The attainment of con- 
cepts: II. The problem. J. gen. Psy- 
chol., 1946, 35, 191-223. 

. Hetwpreper, E. The attainment of con- 
cepts: VII. Conceptual achievements 
during card-sorting. J. Psychol., 1949, 
27, 3-39. 

. Hewsrever, E. Experiments by Dattman 
and Israel on the attainment of concepts. 
J. Psychol., 1952, 34, 115-136. 

. Hewereper, E., Bensiey, M. L., &@ Ivy, M. 
The attainment of concepts: IV. Regu- 
larities and levels. J. Psychol., 1948, 25, 
299-329. 

. Hempreper, E., & Overstreet, P. The 
attainment of concepts: V. Critical 
features and contexts. J. Psychol., 1948, 
26, 45-69. 

. Kenpier, H. H., & D’Amaro, M. F. A 
comparison of reversal and nonreversal 
shifts in human concept formation be- 
havior. J. exp. Psychol., 1955, 49, 
165-174. 

. Linpperc, B. J. Experimental studies of 
colour and non-colour attitude in school 
children and adults. Copenhagen: Levin 
& Munksgaard, 1938. 

. Witson, K. V. A distribution-free test of 
analysis of variance hypotheses. Psychol. 
Bull., 1956, 53, 96-101. 


(Received April 26, 1956) 





Journal of Experimental Psychology 
Vol. 53, No. 5, 1957 


THE EFFECT OF DRIVE REVERSAL ON LATENCY, 
AMPLITUDE, AND ACTIVITY LEVEL! 


ROBERT H. DAVIS? 
Michigan State University 


In a study recently reported by 
Deese and Carpenter (4), it was inci- 
dentally observed that _ shifting 
animals from a low drive to a high 
drive produced a sudden drop in 
running time, but that shifting 
animals from a high drive to a low 
drive failed to produce a significant 
increase in running time. This uni- 
lateral effect due to shift in drive level 
is called here the drive-reversal effect. 
The present study is concerned with 
the drive-reversal effect for thirst as 
well as hunger and with respect to 
three response measures, namely, 
latency, amplitude, and general ac- 
tivity level.’ 


METHOD 


Subjects.—The Ss were 36 male albino rats 


from the colony maintained by the Department 


'This paper is based upon a portion of a 
dissertation submitted to the Department of 
Psychology of Michigan State University in 
partial fulfillment of the requirements for the 
Ph.D. degree. The author is indebted to Dr. 
M. Ray Denny for advice and assistance 
throughout the investigation. 

* Now at The RAND Corp., Santa Monica, 
California. 

§ The original investigation (3) was primarily 
directed at answering a number of methodo- 
logical questions in research on motivation. 
The main problem was to determine an amount 
of water reward which was equivalent to a given 
amount of food reward as an incentive. In- 
centive value was assessed in terms of general 
activity level, and the latency, amplitude, and 
resistance to extinction of an instrumental 
response. For all of these measures, it was 
found that .08 gm. of food was approximately 
equivalent to .20 cc. of water for both high and 
low drive. With respect to latency and activity 
level this is indicated in Tables 1 and 2. Re- 
liability estimates in the form of correlation 
coefficients for activity and force of response 
were as follows: activity .62, and force .86. 


of Psychology of Michigan State University. 
Age ranged from 85 to 125 days, with a mean of 
approximately 90 days. 

Apparatus.—The apparatus permitted the 
joint measurement of three aspects of response: 
(a) latency, (b) amplitude or force, and (c) prior 
activity level. The measures of activity level 
and force differed from previous ones in that the 
force of a skeletal rather than an autonomic 
response was measured, and in that activity 
level was measured in the same apparatus as the 
other responses were. 

Figure 1 presents a cross section of the 
apparatus. At the bottom of the activity 
chamber there was a false floor supported by 
three springs and a rubber ball at its exact 
center. Small rubber balls were attached to 
the corners of this false floor and prevented the 
floor from tipping more than } in. The interior 
of the chamber was painted flat gray and was 
illuminated by a 7-w. bulb located at one end 
of the box and covered with opal flashed glass. 
A guillotine door at the other end of the activity 
chamber, when raised, gave access to a hinged 
4-in. X 2-in. push-panel and activated a Stand- 
ard Electric timer. 

Activity level was measured by a 115-v. 
impulse counter connected in series to a GE 
2-36KRI mercury switch which was suspended 
beneath the floor in such a manner that move- 
ment by S caused the mercury in the tube to 
move, momentarily making and breaking the 
circuit in accordance with the strength and the 
number of the movements. The initial force of 
the push-panel was determined in the following 
manner. A thin, metal rod, hinged at the top 


WiCRO. Swit H 


warte eorrls 


Par. wero 


> iaieeded vanel 


~"tibie Thar 








Fic. 1. Cross-sectional drawing of 
experimental apparatus. 





DRIVE REVERSAL 


of the push-panel was twisted so as to extend to 
the back of the panel in one direction and to the 
top of the box in the other. While the lower 
half of the rod “rode” back on the push-panel 
as it was opened, the upper half swung against 
a slender stick of wood which was attached to a 
light, plastic wheel mounted on a well-oiled axle. 
The initial force applied to the door was thus 
transmitted to the wheel and measured in degrees 
of rotation. Because the wheel offered very 
little resistance to the metal rod, the initial 
movement of the push-panel caused it to turn 
out of range of further movements of the metal 
rod. When the push-panel was opened approxi- 
mately #y in., the clock stopped. In this 
manner the latency of the panel-pushing response 
was measured. 

The food reward pellet was placed in a tray 
which was located approximately } in. below the 
lower panel jamb. Water reward was ad- 
ministered at the same point through a curved 
tube from a water bottle which was retractable. 

Habituation.—All Ss were handled for three 
days prior to the habituation phase of the 
experiment. Up to this time and during 
handling, unlimited food and water were 
available in the home cage. The Ss were 
divided into two major groups: a thirst group 
(N = 12) and a hunger group (N = 24). Each 
day for five days, Ss from both groups were 
placed individually in the activity chamber of 
the box for a 6-min. period, and their activity 
was recorded. On Day 3 the Ss received their 
initial deprivation. The hunger group was 
deprived of food for 234 hr., and the thirst group 
was deprived of water for the same period. On 
Days 4 and 5, the hunger Ss were placed in 
individual feeding cages where, after a 6-min. 
delay, reward pellets were made available for 
15 min. These pellets were followed by a wet 
mash composed of (0% water and 40% ground 
Purina dog chow by weight which was made 
available for an additional 15 min. Once or 
twice during this period, the mash was removed 
from the feeding cages and stirred. Given more 
time the Ss refused to eat, and so at the end of 
this 30-min. period all Ss were returned to their 
home cages where water was available for the 
next 234 hr. but not food. 

On Days 4 and 5, the thirsty Ss were placed 
in the individual feeding cages with Purina 
pellets immediately available and with water 
made available after 6 min. These Ss remained 
in the individual feeding cages for a period of 
45 min. with water and dry food and were then 
returned to their home cages where food but no 
water was available for the next 23} hr. 

Training.—Once per day, for the next three 
days, each S was placed individually in the 
activity chamber for a period of 6 min. All Ss 


311 


at this time were under a 234-hr. drive. At the 
close of this activity period, the push-panel was 
presented according to the following schedule: 
Day 1, door fully open for 8 trials and door } in. 
open for 2 trials; Day 2, door fully open for 2 
trials and door } in. open for 8 trials; and Day 
3, door } in. open for 2 trials and door fully 
closed for 8 trials. 

Hungry Ss were presented on each of these 
trials with reward pellets weighing .20 gm. 
Thirsty Ss were given access to the drinking 
tube for 5 sec. per trials (.20 cc.). Following 
each day’s training, Ss were treated as on Days 
4 and 5 of the habituation period. 

Test period.—Each S was next given a total 
of 40 test trials, 20 successive trials (10 per day) 
at a low-drive level and 20 successive trials (10 
per day) at a high-drive level. The blocks of 
high- and low-drive trials were counterbalanced. 
Half of each group was tested under high drive 
first and then shifted to low drive (H-L), and 
the other half was tested first under a low drive 
and shifted to a high (L-H). 

In order to insure that Ss operating under a 
low drive would respond for the full 10 trials 
per day, it was necessary to determine empiri- 
cally the minimum deprivation time for this to 
occur. Using animals other than the Ss of the 
present study, this value was determined to be 
5 hr. for water deprivation and 8 hr. for food 
deprivation. It should be noted that this 
privation is essentially from an ad libitum diet. 
Hungry Ss under low drive were fed to satiation 
with water available every running day 8 hr. 
before testing and 6 min. after testing. ‘Thirsty 
Ss under low drive were watered to satiation 
with food available every running day 5 hr. 
before testing and 6 min. after testing. High 
drive for hungry Ss was arbitrarily set at 48 hr. 
of food deprivation. These Ss were fed to 
satiation every running day 6 min. after the 
test trials. High drive for thirsty Ss was set at 
36 hr. of water deprivation. These Ss were 
watered to satiation with food available 6 min. 
after test trials every running day, and then 
again at noon on the following day and run 36 
hr. later. The satiation procedure was the same 
as that described for Days 4 and 5 of the habitu- 
ation period. For reasons previously men- 
tioned,‘ the hunger group was divided into two 
equal subgroups: a large reward group (.32 mg.) 
and a small food reward group (.08 gm.). All 
Ss were run between the hours of 8 p.m. and 
1 am. 


RESULTS 


The latency measure. 
sents a plot of the mean latencies for 


4See Footnote 3. 


Figure 2 pre- 





ROBERT H. DAVIS 





[TRAINING PRE-SHIFT 


5 


g 


< 
97 


MEAN LATENCY IN SEC 
2 SS Se 


i 








POST -SHIFT 


oe L-H 


o——— Wek 











<= 2. oe t 


+4 9-10 oI 


Fie, 2. 


o —s« 1920 
TRIALS 


Latency in seconds of H-L and L-H Ss during last 8 training trials and 


T T 


20 preshift and 20 postshift trials. 


the last 8 training trials and the 40 
test trials (preshift and postshift) for 
all 36 Ss. In order to smooth the 
curves, latencies on Test Trials 1 and 
2 through 39 and 40 were paired, and 
the mean latency of each pair was 


TABLE 1 


Mean Latency/Triat in Seconps For 
20 Hicu-Datve anv 20 Low-Darive 
TRIALs 


(N = 6 each group) 


High Drive Low Drive 


Reinforcement |—— —_ — 


Mean Mean | 


2.61 25 
3.42 1.26 


2.65 
2.51 


84 
8.90 3.68 
4.41 | 1.88 


2.49 | 
| 








plotted. These test trial curves re- 
veal a trend similar to that found by 
Deese and Carpenter (4). Animals 
shifted from low drive to high drive 
showed a marked decrease in latency, 
whereas Ss switched from high drive 
to low drive continued to perform at 
about the same level as obtained prior 
to drive reversal. 

The mean latency scores for each 
of the reinforcement subgroups is 
presented in Table 1. In two of the 
three subgroups, latencies decreased 
rather than increased when Ss were 
shifted from high drive to low drive. 
The difference between the mean 
high-drive latency and the mean low- 
drive latency for each of the two drive 
shift conditions (H-L and L-H) was 
tested using a matched t. The mean 
high-drive latency of L-H Ss was 
significantly shorter than the mean 
low-drive latency of the same popu- 
lation (¢ = 5.8, P < .01). However, 
there was no significant difference 
between the mean high-drive latency 
and the mean low-drive latency of 





DRIVE REVERSAL 


H-L Ss (t = .036). The same results 
were obtained when a sign test was 
used for these comparisons. 

The activity measure.—Figure 3 
presents a plot of the mean activities 
of Ss in the two drive shift groups on 
each of the four test days. As in 
the case of the latency measure, a 
trend toward drive reversal was 
apparent. For each of the drive 
shift groups, a matched t was com- 
puted to compare activity when 
measured under high-drive and under 
low-drive conditions. A _ significant 
increase in activity was obtained 
when Ss were shifted from low drive 
to high drive (t = 5.6, P < .01), but 
when Ss were shifted from high drive 
to low drive, the decrease in activity 
was not significant (t = .167). When 
a composite curve (See Fig. 3) was 
drawn based on these data, a consist- 
ent rise in activity was obtained over 
the four test days. The mean activi- 
ties measured under the various drive 
shift and reinforcement conditions are 
presented in Table 2. 


o—o.L-H 


— < 


o-on-l 
o - oCOMBINED 


re 


2 
8 





MEAN ACTIVITY COUNTS 





2 3 
TEST DAYS 


Fic. 3. Mean activity counts on each of the 
4 experimental days of Ss in the 2 drive shift 
groups. Continuous curves based on combined 
data of both groups. 


313 


TABLE 2 


Mean Activity per Test Day at Hicu-Daive 
anv Low-Drive Levets 


(N = 6 each group) 








High Drive 








496 
811 





582 
748 





638 
609 


647 














The amplitude measure.—Obvious 
non-normality in the distribution of 
the amplitude scores required the 
application of a nonparametric sta- 
tistic to the data. Mean high-drive 
amplitudes and mean low-drive ampli- 
tudes were compared using the sign 
test (6). When the drive shift pro- 
cedure was ignored, mean high-drive 
amplitude scores exceeded mean low- 
drive amplitude scores in only 19 of 
the 36 cases, and the null hypothesis 
was accepted. Drive, in other words, 
did not significantly affect the ampli- 
tude of response. When these data 
were broken down in terms of the 
drive shift procedure employed, the 
null hypothesis was accepted in both 
instances. Mean high-drive scores 
exceeded mean low-drive scores in 
only 8 out of 18 H-L cases and 11 out 
of 18 L-H cases. Although both of 
these differences were in the direction 
which would be predicted on the basis 
of the previously reported drive- 
reversal data, neither was significant. 


Discussion 


The drive-reversal effect was clearly 
obtained in the case of latency and 





314 


activity. However, since there was no 
difference between high- and low-drive 
amplitudes, the conditions for a drive- 
reversal effect did not even exist with 
respect to this measure. Although Hull 
has stated that response amplitude is 
one of the aspects of behavior which 
varies as a function of drive, the evidence 
for this is based primarily on autonomic 
measures (7, 8). Since the measures 
utilized in the present study were 
skeletal, the results do not directly 
contradict the evidence on which the 
assumption is based. 

Within the original theoretical frame- 
work of this study, there are two possible 
explanations for the drive-reversal effect. 
One of these is the possibility of residual 
motivation. However, since the Ss in 
the present study were completely sati- 
ated twice in the 24-hr. period following 
the shift from high drive to low drive, it 
appears unlikely that residual motivation 
could account for the continued high 
level of performance of Ss shifted to low 
drive. Moreover, between Days 1 and 
2, low-drive Ss increased their mean 
activity count by 24 while high-drive Ss 


showed a decrease in activity for the 


same period. If residual motivation 
were building up, we would expect it to 
reflect itself in a greater increase in 
activity in those Ss undergoing the 
greatest deprivation which is not the 
case. 

The second possible explanation for 
the drive-reversal effect and one which 
appears more adequate is based on 
Hull’s secondary motivation corollary 
(8, p. 25). This corollary states that 
neutral stimuli, which are repeatedly 
and consistently associated with the 
evocation of a drive, acquire the capacity 
to bring about the drive stimuli. In 
the present study, the abrupt diminution 
in drive was introduced after a large 
number of trials had been administered. 
Presumably, many of the neutral stimuli 
in the box could be conditioned to the 
primary drive stimuli during these trials 
and these previously neutral stimuli 
could become the basis for the continued 
high level of performance of low-drive 
Ss in the postshift phase of the study. 


ROBERT H. DAVIS 


The most convincing bit of evidence in 
favor of this explanation is to be found 
in Fig. 2. An inspection of this figure 
reveals that the unilateral drive-reversal 
effect did not occur when Ss were shifted 
from an intermediate drive to either a 
high drive or a low drive following the 
training phase of the experiment. The 
fact that no drive-reversal effect was 
apparent at this point in the study and 
the fact that it became apparent 20 
trials later, supports the hypothesis that 
the neutral stimuli were acquiring a 
motivating property during this period. 
A third interpretation, which, theo- 
retically, may stray a bit from the other 
two, should also be mentioned. It is 
possible to assume that the responses 
being conditioned under high drive 
differ from the responses being condi- 
tioned under low drive. Under high 
drive the panel-pushing (food approach) 
response is more consistently elicited and 
there are fewer competing responses 
occurring (2). Once this approach re- 
sponse is conditioned under high-drive 
conditions, it is tied to the cues of the 
situation and remains strong even when 
the drive is reduced. On the other hand, 
under low drive no response is being 
consistently elicited or strongly condi- 
tioned (5) and when the conditions are 
shifted to high drive the food approach 
response becomes prepotent which is 
reflected immediately in lowered la- 
tencies. This explanation also fits the 
data of Fig. 2 as the preshift conditioning 
would be the same for Ss shifted both 
up and down. Such a view is related to 
recent findings which indicate that the 
habit for pressing a heavily weighted bar 
in a bar-pressing situation is not the 
same as the habit for pressing a light bar 
(9, 10) and that extinction rate is a 
function of the particular 
elicited by frustration (1). 


response 


SUMMARY 


The drive-reversal effect, i.e., the tendency 
of animals shifted from high to low drive to 
continue performing at the same level as ob- 
tained prior to drive shift, was investigated with 
respect to three performance measures—latency, 





DRIVE REVERSAL 315 


activity, and response amplitude. Thirty-six 
Ss were trained in a panel-pushing apparatus at 
an intermediate drive level. The Ss were then 
given 20 trials at a high-drive level and 20 trials 
at a low-drive level in accordance with one of 
two drive-shift procedures: H-L or L-H. A 
drive-reversal effect was obtained in the case of 
latency and activity but not in the case of 
response amplitude. Several alternative ex- 
planations for this effect were discussed. 


REFERENCES 


1, Apetman, H. M., & Maartscu, J. L. Re- 
sistance to extinction as a function of the 
type of response elicited by frustration. 
J. exp. Psychol., 1955, 50, 61-65. 

2. Corron, J.W. Running time as a function 
of food deprivation. J. exp. Psychol., 
1953, 46, 188-198. 

3. Davis, R. H. The quantification of drive: 
I. Incentive values of food and water. 
Unpublished doctoral dissertation, Michi- 
gan State Univer., 1953. 


4. Deese, J., & Carpenter, J. A. 
and reinforcement. J. exp. 
1951, 42, 236-238. 

. Denny, M. R., & Apetman, H. MM. Elicita- 
tion theory:I. An analysis of two typical 
learning situations. Psychol. Rev., 1955, 
62, 290-296. 

. Drxon, W. J.,& Massey, F. J. Introduction 
to statistical analysis. New York: 
McGraw-Hill, 1951. 

. Huu, C. L. Principles of behavior. New 
York: D. Appleton-Century, 1943. 

. Hurt, C. L. Essentials of behavior. New 
Haven: Yale Univer. Press, 1951. 

. Maarscu, J. L., Apetman, H. M., & Denny, 
M. R. Effort and resistance to extinc- 
tion of the bar-pressing response. /. 
comp. physiol. Psychol., 1954, 47, 47-50 

. Stanrey, W. C., & Aamovt, M. S. Force 
of responding during extinction as a 
function of force requirement during 
conditioning. J. comp. physiol. Psychol., 
1954, 47, 462-464. 


Drive level 
Psyc hol. , 


(Received April 26, 1956) 





Vas, Noh iT ne 


RESPONSE STRENGTH AS A FUNCTION OF 
DELAY OF REWARD IN A RUNWAY! 


WAYNE B. HOLDER,? MELVIN H. MARX, ELAINE E. HOLDER, 
AND GEORGE COLLIER 


University of Missouri 


A number of experiments have 
investigated the effect during acqui- 
sition of delay of reinforcement on 
response strength. In general the 
results have indicated that both the 
rate and asymptote of learning are 
some decreasing function of the length 
of delay and of the distance of the 
delay from the reward. 

A second, motivational, effect is 
sometimes ascribed to delays of rein- 
forcement occurring in response chains 
having some strength. For example, 
Amsel (1, 2) and Brown and Farber 
(4) have suggested that delay of 
reinforcement is one of the conditions 
that produce frustration. Such frus- 
tration is assumed to have the func- 
tional status of an irrelevant drive 
and to strengthen any response which 
accompanies or follows the frustrating 
situation by increasing the general 
motivational level of the organism. 
It is also assumed that the frustration- 
aroused drive has typical drive- 
stimulus properties, and further that 
any stimulus which has been asso- 
ciated with the frustration situation 
acquires conditioned motivational 
properties which will operate to in- 
crease response strength in other 
situations. It has been suggested 
that the amount of frustration pro- 

! This investigation was supported in part by 
a research grant, M-817, from the National 
Institute of Mental Health, of the National 
Institutes of Health, Bethesda, Maryland. This 
experiment was planned and executed by the 
first author under the general supervision of the 
second with assistance from the third; the fourth 
author is primarily responsible for the statistical 
analysis and the form of the present report. 

* Now at Fresno State College, California. 


duced is some function of the length 
of delay. 

Most of the preceding studies of 
delay have examined its effects when 
it closely preceded the goal response, 
and have not employed post-delay 
measures of response strength. 

The present study examines, fol- 
lowing acquisition, the changes in 
response strength before and after 
delay as a function of the length of 
delay, and the effect of a stimulus 
associated with delay of reward on 
the acquisition of response strength 
in a different situation. It was 
designed to test some of the impli- 
cations of the Brown and Farber (4) 
treatment of frustration. 


METHOD 


A pparatus.—The first runway consisted of a 
90-in. unpainted, straight alley 5} in. deep and 
3 in. wide; the length included a 12-in. starting 
box, a 12-in. goal box, and a 14-in. delay box 
halfway between the starting box and the goal 
box. Natural wood doors were at both ends of 
the delay box and there was a Plexiglas door in 
front of the second delay box door. When the 
delay box doors were raised the delay box was 
not distinguishable in the runway. ‘The starting 
box and goal box doors were dark brown tem- 
pered masonite. 

A contact switch on the starting box door 
started a Standard Electric timer when the door 
wasopened. When S brokea photoelectric beam 
6 in. from the starting box door, this timer stopped 
and a second timer started. The second timer 
was stopped in the same way immediately in 
front of the first delay box door. A third timer 
was started by opening the door leading out of 
the delay box and stopped when S crossed a 
photoelectric beam 6 in. in front of the delay 
box door. A fourth timer was started by this 
photoelectric cell and stopped when S broke a 
beam of light immediately in front of the goal 


316 





DELAY OF REWARD IN A RUNWAY 


box door. 
in length. 

The second runway consisted of a 72-in. 
straight alley 5§ in. deep and 3 in. wide; the 
length included a 12-in. starting box and a 12-in. 
goal box. This entire apparatus was painted a 
flat black. 

Subjects.—Thirty-nine male albino rats were 
used. They were approximately 120 days old 
at the beginning of the experiment, at which time 
they were placed on a 23-hr. feeding schedule 
for 10 days and habituated to E and the 
apparatus. 

Procedure.—Training trials were given in the 
first runway. All Ss were given one training 
trial per day for 25 days with approximately 1 
sec. delay and a .3-gm. food reward. The 
Plexiglas door was down on these trials, forcing 
S to stop, and was raised after l-sec. 

Ten test trials, one per day, were then given 
in the same runway, with .3 gm. food reward, 
under the following conditions: l-sec. group, 
1 sec. delay with the same procedure as used in 
the training trials; 15-sec. group, 15 sec. delay; 
and 45-sec. group, 45 sec. delay. ‘The wooden 
doors of the delay box were lowered for the 15- 
sec. and 45-sec. delay groups. A small muffled 
doorbell buzzer sounded during the entire delay 
interval for the 15-sec. and 45-sec. groups. At 
the termination of delay, the wooden doors were 
raised first, followed immediately by the raising 
of the Plexiglas door. Starting times (ST) and 
running times (RT) preceding and following the 
delay box were measured on all training and test 
trials. 

One test trial per day was given for 10 days 
in the second runway. The buzzer from the 
first series of test trials was mounted on the 
starting box of this runway and was sounded 
throughout the trials for all three groups. 


Both runway segments were 26 in. 


RESULTS 


First runway (delay of reward).— 
An analysis of variance was performed 
on Log 1/ST* on the last five training 


*Log (RT +1) and Log 1/ST yielded the 
most symmetrical distributions and the most 
homogeneous variances of the measures ex- 
amined and were therefore used in the analyses. 
The first measure increases, the second decreases, 
with response strength. The results of the same 
analyses on the raw data, RT and ST, and on 
the other transforms tried, yielded in every case 
the same conclusions, with one very minor 
exception. Since the major comparisons among 
the curves in Fig. 1 and 2 differ significantly 
nonparametrically for both the transformed and 





Fic. 1. First Runway: Log 1/ST. 
trials for delay group (D), trials (T), 
and the first and second runway seg- 
ment (R). T and R proved to be 
significant, P(Fy = 3.148; 4, 144) 
< .025 and P(FR = 92.076; 1, 36) 
< .0O1, respectively. These results 
indicate that with respect to starting 
times the rats had failed to reach 
asymptote and that the starting times 
from the start box were significantly 
shorter than those from the door of 
what was to become the delay box. 
The data of this analysis are presented 
in the first and third panels of Fig. 1. 
An analysis of variance on Log 1/ST 
for length of delay (D), trials (T), 
and runway segment before and after 
delay (R) is presented in Table 1. 
The main effect of R and the inter- 
actions of R X T and R X D were 
significant. The data of this analysis 
are presented in the second and fourth 
panels of Fig. 1. ‘Tests of trends for 
the individual TX R XD combi- 
nations were performed using the 
appropriate error term from the 
over-all analysis. Both the before 


untransformed data, the results of the analysis 
of variance are not artifacts of the transforma- 
tions. The results were, therefore, summarized 
in terms of RT and ST, both of which decrease 
with response strength. 





318 


W. B. HOLDER, M. H. MARX, E. E. HOLDER, AND G. COLLIER 


TABLE 1 


ANALYSIS OF VARIANCE For THE First Runway 





Source 


Log 1/ST Log (RT +1) 





ms 





Between S’s 
Delay (D) 
Error (b) 


Within Ss 

Trials (T) 

Runway segment (R) 
T KR 


Error (w) 
Error, (w) 
Errors(w) 
Errory(w) 


Total 








*P < 025 
“-P < OOS, 
Pp < OO, 


and after delay curves for 15-sec. and 
45-sec. delays showed significant 
trends at least at the .05 level, while 
neither of the l-sec. curves showed a 
significant trend. From the plots 
and the analysis it can be seen that 
ST’s from the start box increased over 
trials as a function of length of delay 
and ST’s from the delay box decreased 
over trials with length of delay. For 
the before-delay ST’s there is an 
inversion between the 1I5-sec. and 
45-sec. group, the former showing the 
longest ST’s. This inversion is non- 
significant by the analysis of variance, 
although by the binominal expansion 
the curves differ at the .O1 level. 
Whether this inversion was due to 
the initial, nonsignificant, difference 
or represents an experimental differ- 
ence could not be determined from 
these data. The failure of either of 
the l-sec. curves to show any signifi- 
cant trend indicates that the ST’s 
were near asymptote. The initial 
difference in ST’s for the two runway 





00227 
00865 


946 
IE bee 
2.819°* 


000157 
.02468 
00072 
00012 
03804 
00032 
00024 
00018 
OO116 
00018 


8D 
21.344*** 
3,.928°** 
687 
32.896*** 
1.795* 


817 
16.253*** 











segments appearing during training 
remained during test. 

An analysis of variance was per- 
formed on Log (RT + 1) on the last 
five training trials for delay group 
(D), trials (T), and first and second 
runway segment (R). Only R proved 
to be significant, P(Fy = 29.189; 1, 
144) < .0O1, indicating that Ss were 
near asymptote on this measure and 
that they ran the second runway 





Fic. 2. First Runway: Log (RT + 1). 





DELAY OF REWARD IN A RUNWAY 


segment significantly more slowly. 
The data of this analysis are presented 
in the first and third panels of Fig. 2. 
An analysis of variance on Log 
(RT + 1) for length of delay (D), 
trials (T), and runway segments 
before and after delay (R) is presented 
in Table 1. The main effect of R, 
and the interactions of R and T, 
R and D, and R, T and D, were 
significant. The data of this analysis 
are presented in the second and fourth 
panels of Fig. 2. Tests of trends for 
the individual T XK R X D combi- 
nations were performed using the 
appropriate error term from the over- 
all analysis. Both the before- and 
after-delay curves for the 45-sec. 
group and the before-delay curve for 
the 15-sec. group showed significant 
trends, while neither the l-sec. curves 
nor the curve for the 15-sec. group 
following delay showed any significant 
trends. The trend of the 15-sec. 
curve, following delay, was very nearly 
significant at the .05 level. The 
over-all differences between the 15- 
sec. and 45-sec. groups and the 15-sec. 
and l-sec. groups for before and after 
delay separately were significant at 
least at the .05 level. From the plots 
and the analysis it can be seen that 
RT’s from the start box increased as 
a function of length of delay over 
trials, and RT’s from the delay box 
decreased as a function of length of 
delay over trials. In both cases the 
rate of increase or decrease was also 
a function of length of delay. It may 
further be noted that while there was 
a significant over-all difference in 
RT’s between the two runway seg- 
ments, the 45-sec. group following 
delay reached a level of RT’s signifi- 
cantly faster than that for all other 
segments and groups (cf. Fig. 2). 
The correlations between Log 1/ST 
and Log (RT + 1) for Trial 2 and for 
Trial 10 before delay were — .738 


-90 


-80 


-70 


-60 








Fic. 3. Second Runway: Log (RT + 1). 


and — .740, respectively. The same 
correlations after delay were — .562 
and — .751. There were no signifi- 
cant group differences for the cor- 
relations computed separately. Line- 
arity and homoscedasticity were not 
rejected, 

Second Runway.—An analysis of 
variance was performed on Log 
(RT + 1) for delay group (D) and 
trials (T). Both T and D proved to 
be significant while the T X D inter- 
action did not, P(Fp = 4.589; 2, 32) 
< .0025, P(Fr = 37.738; 9, 324) 
< .0O1, and P(Fyp = 1.193; 18, 324) 
> .05. The data of this analysis are 
presented in Fig. 3. The difference 
between the 15- and 45-sec. delay 
curves was separately tested and 
failed to reach significance. These 
plots and the analysis show that the 
RT during the course of training in 
the second runway is some function 
of the length of delay on the first 
runway. For the number of trials 
run, the curves do not become sig- 
nificantly nonparallel. 


Discussion 


At least four very interesting effects of 
delay introduced into a chain of responses 
at nearly asymptotic strength were ap- 





320 


parent in this study. First, response 
strength decreased before the delay. 
Second, response strength increased after 
the delay, apparently to a level exceeding 
that obtained during the nearly asymp- 
totic training sessions. Third, both of 
these effects did not occur immediately 
but grew over trials. Fourth, a stimulus 
associated with the delay interval was 
effective in producing a greater response 
strength in a different situation. 

At least two current points of view 
seem equally well able to account for 
these data. They may be interpreted 
in terms of the hypothesis that delay 
results in an aversive, motivational state, 
whose magnitude is a function of the 
length of the delay, and which can be 
evoked by stimuli associated with the 
conditions producing the state (1). On 
this hypothesis the decrease in response 
strength in the segment preceding delay 
is either the result of the learning of an 
avoidance response to the noxious stimuli 
associated with frustration (4) or the 
development of fractional anticipatory 
frustration response with inhibitory 
properties (1). The increased response 
strength following the delay would be 
accounted for in terms of the irrelevant- 
drive properties of frustration. The 
increased response strength in the second 
runway would be the result of the con- 
ditioning of the irrelevant motivation 
aroused by the delay. This hypothesis 
differs from the usual accounts (e.g., 11, 
19, 20) of the effects of delay which pred- 
icate a decline in response strength as a 
consequence of increased delay of reward. 
Hull (11) accounts for this in terms of a 
motivational variable, J, which multi- 
plies habit strength. Spence (20) offers 
an alternative account relating J to the 
magnitude of the conditioned fractional 
anticipatory goal response and giving it 
associative properties. 

The data may also be interpreted from 
an associative point of view. According 
to this hypothesis, the decrease in 
response strength in the runway seg- 
ment preceding delay is the product of 
extinction resulting from the increased 
delay in reward; that is, the response 
strength falls to a new asymptote com- 


W. B. HOLDER, M. H. MARX, E. E. HOLDER, AND G. COLLIER 


mensurate with the delay of reward. 
The increase of response strength in the 
runway segment following delay would 
be the joint product of the adaptation or 
extinction of competing responses to the 
block or the delay box door, the growth 
of strength of orientative and prepara- 
tory responses, similar to those in the 
start box, and the closer proximity of the 
reward in the second segment. Thus 
the effect of the delay would be func- 
tionally to cut the runway in two with 
the consequent shallower gradient of 
reinforcement (12) and faster RT in the 
second segment. The greater response 
strength shown by the delay groups in 
the second runway is due to the buzzer 
becoming a conditioned stimulus for 
“faster” running. 

The data of the first segment seem to 
be completely impartial. Avoidance 
learning (4), fractional anticipatory frus- 
tration (1), or extinction seem equally 
able to handle the results. Since the 
start box was farthest from reward, 
start-box ST would be expected, on 
ordinary delay-of-reinforcement notions, 
to have gained the least response 
strength, as it apparently did; and there- 
fore, to be least affected by the intro- 
duction of delay. If this reasoning is 
correct, the inversion of the 15-sec. and 
45-sec. group on ST from the start box 
is not disturbing to any of the points of 
view, since it would, then, be most 
sensitive to the operation of extraneous 
factors, such as original group differences, 
which might have been operating. All 
three hypotheses would predict that both 
ST and RT would fall to new, lower 
asymptotes of response strength. The 
associative account is similar to the one 
given by Brown and Gentry (5, 6) for 
the failure of their delay rats to learn the 
pre-delay segment of their maze and 
Holder (10) in accounting for the in- 
creased RT of rats shifted from 40- to 
70- and 100-sec. delays. Cooper (7) and 
Gilhousen (9) have reported similar 
decreases in RT for rats delayed in the 
middle of a maze. 

While there is rough qualitative agree- 
ment, the data of the second segment 
pose a serious difficulty for the irrelevant- 





DELAY OF REWARD IN A RUNWAY 


motivation hypothesis. A number of 
studies have demonstrated, in varied 
situations, that increased vigor of re- 
sponse follows interruption of, delay of, 
or failure to reward. For example, 
Marzocco (14), Rohrer (16), and Shef- 
field (18) present data in which the 
effect of failure or delay of reward is 
presumed to manifest itself immediately. 
On the other hand, Geier and Tolman 
(8), who obtained measures of activity 
during delay, showed an _ increasing 
effect (on activity) over trials, while 
Logan (13) showed a gradual decline of 
response strength over trials as the 
result of the introduction of an increased 
delay. This growth of response strength 
over trials presents the major difficulty 
for the irrelevant-drive hypothesis. The 
effect of blocking should result, following 
Amsel (2) or Brown and Farber (4), in 
an immediate increment in drive which 
should multiply whatever habits are 
present and produce an immediate 
increment in response strength. Hull’s 
latest formulation (11) would predict 
that the introduction of delay would 
result in an immediate decrement in 
response strength. The spacing of the 
test trials in the present experiment 
precludes any interpretation in terms of 
an accumulation of motivation or of any 
immediate increment, and necessitates 
some sort of associative process with 
respect to the irrelevant motive in order 
to account for the gradual change in 
response strength. Amsel (1) has sug- 
gested one such mechanism in his frac- 
tional anticipatory frustration concept. 
A number of other such explanations 
could be invented (e.g. 17). 

The inverse relation between response 
strength and length of delay, with the 
45-sec. group’s RT’s falling below those 
of any other condition, presents the 
major difficulty for the associative ac- 
count. The effect of interrupting a run 
is shown by all groups during training 
where both ST and RT are consistently 
slower following delay. This may re- 
sult, on the present hypothesis, from 
unconditioned (emotional) or condi- 
tioned responses to blocking or to 
“irrelevant” responses to the new stimuli. 


321 


Where these responses compete with the 
ongoing behavior, a decrement would be 
expected, as in the present study; where 
they do not, facilitation would be ex- 
pected as, for example, in Amsel and 
Roussel (2) or Marzocco (14). Thus 
the differential effects of delay would be 
accounted for in terms of the differential 
amounts of adaptation or extinction of 
competing responses during delay and 
the greater likelihood of a “‘well-oriented” 
rat. Some support for this notion is 
provided by the results of Brown and 
Gentry (5, 6), who found no differences 
in RT in the portion of a maze following 
delay between a control group (not 
delayed) and a group delayed for 45 secs. 
on a delay wheel. The results of Amsel 
and Roussel (2) can also be accounted 
for in this way. Since they failed to 
report the RT’s and ST’s for the first 
segment of their maze and since they 
used each rat as ‘its own control, alter- 
nating reward and nonreward conditions 
with the resulting possibility of con- 
founding and obscuring the goal gradient 
and like effects, it is difficult to give a 
clear, post hoc account of their results. 
Resistance to extinction measures, 
which were not possible here, would 
supply useful information with respect 
to the covariation of RT and ST with 
number of trials to extinction. As is 
suggested in the Sheffield (18) and 
Stanley (21) studies, little relation would 
be expected from the irrelevant moti- 
vation point of view; that is, RT and 
ST would presumably be sensitive to 
increased motivation, while habit 
strength as measured by trials to ex- 
tinction, would not. The opposite pre- 
diction would be made by the associative 
hypothesis. Information on the oper- 
ation of secondary rewards in the present 
situation would also be useful in making 
a decision between the two hypotheses. 
One problem posed for both accounts 
is the role of the buzzer used as a CS. 
While the dampened buzzer was prac- 
tically imperceptible to humans, it is 
known that white noise at 120 db. is 
noxious for the albino rat (15). The 
present results might be accounted for 
in terms of “time in the presence of the 





322 


noxious stimulus.” The fact that the 
1-sec. group, which had not previously 
been exposed to the buzzer, was the 
slowest group in the second runway, and 
that gradual learning curves occurred for 
all groups, argues against this interpre- 
tation of the results. 

Both hypotheses would predict that 
response strength in the second runway 
would be some increasing function of the 
length of delay. The present study and 
that of Bernstein (3) are the only two 
which have attempted to condition the 
effects of delay of reward. 

In both studies the results may be 
interpreted as the conditioning of some 
motivational state or as the conditioning 
of responses or components of responses 
used in the second situation. The latter 
is more difficult in the Bernstein study 
since there was no mediating conditioned 
stimulus. The failure of the 15-sec. and 


45-sec. groups to differ in the present 
experiment could be accounted for by 
either hypothesis in terms of the lability 
of the conditioning which occurred in 
the small number of pairings. 

While the present data have been 
indecisive with regard to theory, their 


clarity sheds much light on the effect of 
delay in the runway situation and offers 
a number of interesting leads for further 
research. 


SUMMARY 


Thirty-nine albino rats were trained to near 
asymptote on a straight runway in which each 
run was interrupted in the middle of the runway 
for 1 sec. Following training, Ss were divided 
into three groups. The first group continued 
under the training conditions, the second was 
delayed in the middle of the runway for 15 sec., 
and the third for 45 sec. A buzzer was sounded 
during the delay period for the 15-sec. and 45- 
sec, groups. The three groups were then run 
on a second runway in which the buzzer which 
had been associated with delay was sounded 
throughout a run. 

On the first runway it was found that response 
strength, as measured both by ST and RT, was 
a decreasing function of the length of delay in 
the segment of the runway preceding the delay, 
and an. increasing function of delay in the seg- 
ment of the runway following delay. On the 
second runway the acquisition curves for RT 
for both of the delay groups fell below that for 
the control. 


W. B. HOLDER, M. H. MARX, E, E. HOLDER, AND G. COLLIER 


Alternative interpretations of the data were 
outlined in terms of two sets of hypotheses: 
(a) that delay of reinforcement occurring in a 
chain of highly trained responses produces an 
aversive motivational state which can be evoked 
by a stimulus associated with conditions pro- 
ducing the state and which is added to the on- 
going motivational complex; and (b) that delay 
of reinforcement extinguishes the members 
preceding it in a response chain, that the in- 
creased vigor of response in the second segment 
is the joint result of differential extinction and 
adaptation of competing responses to the delay, 
and the closer proximity of the reward, and that 
stimuli associated with these responses will tend 
to evoke similar responses in similar situations. 


REFERENCES 


. Amser, A. A three-factor theory of in- 
hibition: an addition to Hull’s two-factor 
theory. Paper read at the Southern 
Society for Philosophy and Psychology, 
Roanoke, Va., March, 1951. 

. Amset, A., & Rousset, J. Motivational 
properties of frustration: I. Effects on a 
running response of the addition of 
frustration to the motivational complex. 
J. exp. Psychol., 1952, 43, 363-368. 

. Bernstein, B. B. Extinction as a function 
of frustration. Unpublished doctor’s dis- 
sertation, Univer. of Missouri, 1954. 

. Brown, J. S., & Farser, I. E. Emotions 
conceptualized as intervening variables 
with suggestions toward a tentative 
theory of frustration. Psychol. Bull., 
1951, 48, 465-495. 

. Brown, W. L., Gentry, G., & Kaptan, S. J. 
The effect of intra-maze delay. 1. Delay 
enforced by a revolving wheel. J. comp. 
physiol. Psychol., 1948, 41, 258-268. 

. Brown, W. L., & Gentry, G. The effect 
of intra-maze delay. II. Various in- 
tervals of delay. J. comp. physiol. Psy- 
chol., 1948, 41, 403-407. 

. Coorer, J.B. The effect upon performance 
of introduction and removal of a delay 
within the maze. J. comp. Psychol, 
1938, 26, 457-462. 

. Geter, F. M., & Toiman, E. C. Goal 
distance and restless activity. I. The 
goal gradient of restless activity. /. 
comp. Psychol., 1943, 35, 197-204. 

. Gitnousen, H. C. Temporal relations in 
anticipatory reactions of the white rat. 
J. comp. Psychol., 1938, 26, 163-175. 

. Hoxver, E. Response strength as a func- 
tion of changes in the length of the delay 
interval. Unpublished master’s disserta- 
tion, New Mexico State College, 1951. 





DELAY OF REWARD IN A RUNWAY 323 


. Hut, C. L. A behavior system. New 
Haven: Yale Univer. Press, 1952. 

. Hutt, C. L. The rat’s speed-of-locomotion 
gradient in the approach to food. J. 
comp. Psychol., 1934, 17, 393-422. 

. Locan, R.A. The role of delay of reinforce- 
ment in determining reaction potential. 
J. exp. Psychol., 1952, 43, 393-399, 

. Marzocco, F. N. Frustration effect as a 
function of drive level, habit strength and 
distribution of trials during extinction. 
Unpublished doctor’s dissertation, State 
Univer. of Iowa, 1950. 

. Potanp, R. G., Herstrom, E. A., & Davis, 
R. T. Running speed of rats as a func- 
tion of auditory stimuli. Proc. S. D. 
Acad. Sci., 1952, 31, 177-181. 

. Ronrer, J. H. A motivational state re- 
sulting from non-reward. J. comp. 
physiol. Psychol., 1949, 42, 476-485. 


17. Sewarp, J. P. Secondary reinforcement as 


tertiary motivation: a revision of Hull's 
revision. Psychol. Rev., 1950, 57, 362 
374. 


. Suerrievp, V. F. Resistance to extinction 


as a function of the spacing of extinction 
trials. J. exp. Psychol., 1950, 40, 305- 
313. 


. Spence, K. W. The role of secondary 


reinforcement in delayed reward learning. 


Psychol. Rev., 1947, 54, 1-8 


. Spence, K. W. Theoretical interpretations 


of learning. In C. P. Stone (Ed.), 
Comparative psychology. New York: 
Prentice-Hall, 1951. 


. Stantey, W. C. Extinction as a function 


of the spacing of extinction trials. /. 
exp. Psychol., 1952, 43, 249-300. 


(Received April 27, 1956) 





mental Psychology 
1957 


Journal _™ 
Vol. 53, No. 5, 


FACTORS CONTRIBUTING TO GENERAL VERSUS 
SPECIFIC PERCEPTUAL LEARNING! 


J. ALFRED LEONARD H. WESTON CLARKE, AND SARA R. STAATS 
The Ohio State University 


Harlow’s studies on the formation 
of learning sets (4) have revived 
interest in the “increments of per- 
formance that occur as a subject 
passes from sample to sample of a 
single class of material” (6, p. 307). 
In general, it has been found that 
experience with and training on one 
sample of a class facilitates learning 
on another sample of the same class. 

We became interested in these 
findings for two separate though 
related reasons. First, although the 
phenomenon is one of the best 
established in learning literature no 
attempts seem to have been made to 
analyze it in detail. For instance, 
transfer between samples has usually 
been measured by reference to the 
saving relative to initial performance, 
but not with reference to the per- 
formance of a nontransferring control 
group at comparable posttransfer 
stages. Similarly, it is not clear to 
what extent the transfer effects found 
can be ascribed directly to familiarity 
with the class of material as apart 
from familiarity with the task or 
experimental situation. It seemed 
possible to answer such questions by 
a suitable manipulation of the pre- 
training conditions of separate groups 

! This research was conducted in the Labora- 
tory of Aviation Psychology of The Ohio State 
University through support from a contract 
between the Office of Naval Research and the 
Ohio State University Research Foundation 
(Contract No. Nonr-495(02). The authors 
wish to record their indebtedness to Drs. G. E. 
Briggs, P. M. Fitts, and D. D. Wickens for 
encouragement and assistance. 

* Visiting Research Associate from the 
Medical Research Council, Applied Psychology 
Research Unit, Cambridge, England. 


of Ss and comparing the effects of 
such groups on transfer to a common 
situation. 

Second, it seemed feasible to utilize 
the information so obtained for the 
purpose of making inferences about 
the relative importance of various 
aspects of the total learning situation 
to the performance of the nontrans- 
ferring control group. The develop- 
ment of stimulus material and 
methods for the study of visual figure 
recognition described by Fitts et al. 
(2) provided an opportunity for 
investigating some of the problems 
raised. The complexity levels of 
their “‘metric figures” are defined by 
the number of stimuli in a specifiable 
population. At the higher levels of 
complexity it is possible to generate 
very large numbers of figures, and it 
is convenient to make use of several 
samples of limited size for experi- 
mental purposes. Moreover, one has 
a priori grounds for assuming equal 
difficulty among samples drawn at 
random from the same population 
provided the samples are of adequate 
size. 

Considering a group of Ss operating 
with a sample of metric figures on a 
standard identification task one can 
attribute the level of performance at 
any given stage during training to 
learning of the properties of the class 
of stimuli, the population of stimuli, 
the specific figures making up the 
sample, and the task required. In 
the experiment to be described, the 
pretraining conditions of four groups 
were manipulated so as to include all 
or some of these components of the 





PERCEPTUAL LEARNING 


common posttransfer condition: an 
identification task with samples of 
metric figures. 


MetTHopD 


Stimuli.—Two kinds of stimuli were used in 
this experiment. Properties of the metric 
figures have been described elsewhere in detail 
(2, p. 2). They are black on white, histogram- 
like figures surrounded by a .4-in, white border 
and mounted in 1.6 X 1.6-in. plastic covers. 
The present figures were ones in which any one 
of eight column heights could appear in any one 
of eight adjacent positions, i.c., the 8 X 8 
random population as specified. Three samples 
of eight figures each were drawn from the total 
possible population of almost seventeen million 
figures. The other class of stimuli was easily 
distinguishable, black-on-white silhouettes of 
common objects (ship, cup, dagger, etc.) 
mounted on the same sized covers as those used 
for the metric figures. 

Tasks.—Two tasks were used. The first was 
the standard identification task described in 
Fitts, et al. (2, p. 6). In this task Ss are pre- 
sented with a board on which a given sample of 
figures is replicated in six horizontal rows. The 
second was a classification task in which Ss are 
presented with a pile of figures consisting of 
several copies of each figure making up the 
sample. One copy of each figure is laid out in 
front of S who has to pick up one figure at a time 
from the pile and place it in a stack below the 
appropriate test figure. 

Experimental design.—This is shown in detail 
in Table 1. The four main groups differed 
among themselves in the conditions of pre- 
training but had in common the posttransfer 
condition. Each group of 45 Ss was made up 
of three subgroups of equal size to permit the 
use of three samples of metric figures when 
required. Group I, acting as a control, had 
pretraining conditions identical with those 
obtaining during posttransfer. The Ss in each 
of the three subgroups of Group II kept the 
identification task (I ‘Task) constant, but 
transferred from one sample of metric figures to 
another. On transfer, therefore, the properties 
of specific figures were new to them. Group 
III kept samples of metric figures constant 
throughout, but were pretrained on the classi- 
fication task (C Task). On transfer they had 
to learn the use of their figures in the standard 
identification task. Group IV again kept the 
standard identification task constant, but were 
pretrained on a sample of very easily distin- 
guishable figures. They were as unfamiliar 
with metric figures when transferring to them 


TABLE 1 


Exprerimentat Desicn 
(15 Ss in each subgroup) 





Pretraining 
Trials 


Posttransfer 
Trials 


Sub- 





Met rie A 


Pictures 
I Task 
Pictures 

ask 
Pictures 
I Task 











as the members of all other groups at their first 
pretraining trial. 

An attempt was made to match, in terms of 
means and variances, those pairs of subgroups 
of Groups I and II which performed on identical 
samples during pretraining. Groups I and II 
were run first, followed by Groups III and IV. 

Subjects.—The 180 Ss were male and female 
volunteers from introductory psychology courses 
at The Ohio State University. 

Procedure.—For the identification task this 
was as follows: the six replications of a given 
sample of figures were arranged on the sorting 
board, one to each horizontal row, in such a 
manner that the same figure would never appear 
above or below itself. From a seventh repli- 
cation E selected one of the figures, placed it in 
the center above the top row of figures, and 
allowed S to inspect this figure for approximately 
2 sec. before saying “Go” and starting the timing 
device. The S immediately began to look for 
the appropriate figure in the first row, turned it 
over when found, and proceeded in this manner 
through the sixth row. Having found the 
correct figure in that row, S had to press a 
button to stop the timing device. The S was 
allowed to use only one hand and had to find 
the correct figure in each row before being 
allowed to proceed. An easily distinguishable 
color code on the back of each figure provided 
the necessary knowledge of results. The £ 





326 


recorded the time taken for this item, i.e¢., to 
find the six copies of the test figure on the board, 
while S replaced the turned-over figures. The 
E then selected another figure and repeated the 
above process until all the eight figures had been 
dealt with once, which was held to constitute 
one trial. The £ then told S the time taken for 
this trial, i.c., the cumulative total for eight 
items. This time was accrued on a counter 
pulsed at the rate of 10 per sec., the clicks being 
clearly audible to S from the moment E said 
“Go” until S pressed the button. The figures 
in each row were then picked up and reshuffled 
before the whole process was repeated. 

For the classification task the procedure 
differed primarily in that S was required to 
deal with the whole sample of figures without 
any interruption. He was again given the time 
per trial from the counter at the end of each 
trial, 

The S stood in front of the sorting board with 
E standing beside him. Under all conditions, 
regardless of stimuli or task used, Ss had to make 
48 responses per trial. The intervals between 
trials were of approximately 2 min. duration 
except for the one immediately preceding the 
transfer stage which lasted for 3 min. and during 
which Ss left the experimental booth. In 
structions stressed both speed and accuracy. 

Scoring.—The time per item (c.f., procedure 
above) was recorded to the nearest .O1 sec. from 
a Standard Electric timer. Summed over the 
eight items constituting a trial this provided a 
time per trial. These values were divided by 
48 (6 rows X 8 figures per sample) to give a 
mean time per figure per row per trial for each 
S. The results are presented in these terms in 
order to maintain uniformity with other publi- 
cations in the series of studies on visual figure 
recognition at Ohio State University. 


RESULTS 


The results are presented in Table 
2 in terms of mean times per figure 
for the six trials. For the pretraining 
stage the data have been combined 
for the three subgroups within each 
main group, these data serving mainly 
to indicate the level at which pre- 
training took place. In evaluating 
the performance of Group III it 
should be recalled that this was the 
only group which did not perform the 
standard identification task during 
pretraining, but used the classification 
task. Although Ss in both tasks 


J. A. LEONARD, H. W. CLARKE, AND S. R. STAATS 


TABLE 2 
Mean Times per Ficure tn Seconps 





Pretraining Trials | Posttransfer Trials 





stats 


jas 2.72 ‘ee 


j3.40 2.72 | 


jenn | 31 
1.06 | 1.00 


Note.—For Trials 1-3 the means are for 45 Ss each; 
for Trials 4-6, 15 Ss each. 





bu 











were required to make the same 
number of responses per task, the 
nature of those responses was dif- 
ferent so that one cannot make a 
strict comparison between the two 
tasks. The case is entirely different, 
however, for Group IV, which per- 
formed the same task as Groups | 
and II but used very different 
stimulus material, chosen for its great 
simplicity. The difference between 
Groups I and II on the one hand and 
Group IV on the other is a direct 
measure of the relative difficulty of 
the stimulus materials used. 

The results of an analysis of vari- 
ance for the posttransfer data using 
a partially hierarchical model (5, p. 
102) are summarized in Table 3. The 
attempted matching of pairs of sub- 
groups belonging to Groups I and II 
was not taken into account, in view 
of the large number of degrees of 
freedom involved in the total analysis. 
The results indicate that there were 
significant effects due to differences 
in pretraining and to continued prac- 
tice, and that there was significant 
interaction between these two effects. 
Any differences in performance due 
to samples of metric figures were 
shown to be nonsignificant. 





PERCEPTUAL LEARNING 


TABLE 3 
ANALYsiIs OF VARIANCE FOR 
PosTrRaNSFER Data 





MS 


5.3435 
1.4446 
1.0168 

7321 


12.1453 
4927 
.0344 
0127 
0820 


Groups (G) 
Samples (Sa) 
G & Sa 
Ss/G/Sa 


148.11** 
6.01%" 


Trials (T) 
Gx T 


Sa X T 
GX Sa X T 
T X Ss/G/Sa 








Tests of significance among means for the 
four main groups at each of the three trials were 
carried out using Duncan’s method (1). In 
view of the large number of degrees of freedom 
it was decided to test for differences only at the 
1% level. The appropriate error term for the 
test was T X Sx/C/Sa. Groups II, III, and 
IV all differed significantly from Group I for all 
three trials, and Group IV differed from II and 
III only on Trial 4, i.e., the first posttransfer 
trial. It was also shown that the significance 
of differences due to continued practice was a 
function of the pretraining conditions of the 
main groups; these differences were significant 
throughout for Groups III and IV only. For 





Legend 
o———® Grove I 


———) Grove B 


Time per Figure in Seconds 


o—————4 “roe 0 
o———© Grove W 


o—----© Group I* 








[—Pretroimeg—— 
‘ 2 5 4 s e 
Trials 

* Pretromng Curve Tramsposed 

Fic. 1. Pretraining and posttransfer learning 
curve for Group I, posttransfer curves only for 
the other groups. The pretraining curve for 
Group I has also been transposed to the post- 
transfer position for purposes of comparison. 


[—Post-Wronster —y 





327 


Group II there were significant differences 
between Trials 4 and 5, and 4 and 6, for Group I 
only between Trials 4 and 6. 


In Fig. 1 learning curves are pre- 
sented for the pretraining trials of 
Group I and the posttransfer trials 
of all four groups. To enable a com- 
parison of all the variations in pre- 
training used, the data of Group I 
for Trials 1-3 have been transposed 
to Trials 4-6, indicating the effect of 
no pretraining at all. 

For present purposes the most 
meaningful measure of transfer is the 
one which relates performance of a 
given experimental group to both the 
initial performance of the control 
group as well as the control group’s 
performance at the same posttransfer 
trial. In Gagné’s terminology this is 
really an improvement score and is 
calculated from the formula 

C, == Ek, 

OY May 
where C stands for control group, £ 
for experimental group, subscript / 
for trial 1, and subscript x for a given 
posttransfer trial (3, p. 102, formula 
4). This measure takes the control 
group’s improvement between Trials 
1 and x as 100% and expresses the 
experimental group’s performance on 
Trial x in relation to this. In Table 
4 these values are given for Groups 


IT, III, and IV for Trials 4 and 6. 


Discussion 


Considering Groups I and II by them- 
selves it has been shown that for the 


TABLE 4 


Transrer Percentaces ror Grovurs Il, III, 
anv IV vor Triats 4 And 6 


Group Trial 4 Trial 6 
I 59.45% 
Ill 48 649, 
IV 34.23% 


79.85% 





328 


present type of material, transfer among 
different samples of the same class of 
material resulted in about 60% transfer 
on the first posttransfer trial, increasing 
to about 87% on the third posttransfer 
trial. Both groups continued to improve 
performance between Trials 4-6, but 
clearly Group II had the greater im- 
provement so that amount of transfer 
increased between Trials 4-6. A com- 
parison of the data of Groups I and IV 
permits amplification of the statement 
concerning the efficiency of transfer 
among samples of material. It seems 
fair to say that Group IV operated 
under conditions most nearly similar to 
those used in more conventional transfer 
experiments, i.e., practicing the motor 
aspect of a task and transferring to a 
very different type of material. On 
transfer, Ss in this group had to learn 
from scratch about the properties of our 
metric figures. Yet, on the second and 
third posttransfer trial there was no 
significant difference between the per- 
formance of this group and the other 
two experimental groups. By the third 
posttransfer trial the amount of transfer 
was approximately 80%, or only 7% less 
than the group having most transfer. 
It is felt that the comparisons between 
the posttransfer performance of Groups 
II and IV serve to place the effects of 
transfer among samples of the same class 
of material in their proper perspective. 
They show that although initially the 
differences between effects of this type 
of transfer and a more conventional one 
may be considerable, after little addi- 
tional training the benefit of one need 
not be any greater than that of the other. 
It is worth noting at this time that all 
three experimental groups were still 
significantly different from the control 
group at Trial 6. 

Lastly, it is now possible to assess the 
relative importance of different aspects 
of the total learning situation studied. 
How much, in other words, is the im- 
provement of Group I between Trials 1 
and 4 due to learning that is specific to 
the actual figures used, to the class of 
figures, and to the task, both in the 
perceptual and in the motor sense? 


J. A. LEONARD, H. W. CLARKE, AND S. R. STAATS 


Inasmuch as all three experimental 
groups were better on Trial 4 than the 
control group on Trial 1, but worse than 
the control group on Trial 4, the decre- 
ments and the savings may be used to 
estimate the relative importance of 
various components in learning. This 
amounts to an analytical sampling of the 
state of learning of the control group at a 
given trial and is not necessarily an 
examination of possibly concurrent learn- 
ing processes. 

The Ss in the control group, in order 
to perform 1.11 sec. per figure better on 
Trial 4 than on Trial 1, must have learned 
motor and perceptual aspects of the 
situation, both of which in turn may be 
considered to have specific and general 
characteristics. Groups II and IV pro- 
vided one line of evidence. The former 
were .66 sec. better on Trial 4 than the 
control on Trial 1, presumably by virtue 
of familiarity with the class of material 
plus both the perceptual and motor 
aspects of the identification task. The 
Ss in Group IV were only .38 sec. better 
on Trial 4 than the control on Trial 1, 
presumably because all they had learned 
were the motor and perceptual aspects 
of the identification task. This leaves 
.28 sec. of the differences between control 
on Trial 1 and Group II on Trial 4 to be 
accounted for, but it is clear that part 
of the “‘saving’’ obtained by Group II 
can be attributed to learning of the task 
rather than of the class of material. 
What makes Group II worse on Trial 4 
than the control group at that trial is 
lack of familiarity with the specific 
figures. This familiarity is, however, 
possessed by Group III, which in turn 
lacks nearly all the knowledge about the 
specific aspects of the identification task 
possessed by Group II. It is most 
interesting to find that in the statistical 
sense these two groups were not signifi- 
cantly different from each other through- 
out the transfer stage. It may be noted 
that Groups II and III complement each 
other in the sense that each has knowl- 
edge about two aspects of the situation 
which may in turn be called either 
specific or general, depending on whether 
one emphasizes the task or the material. 





PERCEPTUAL LEARNING 


Meyer (7), in discussing the results of 
his learning-set studies, has suggested 
the terms “interproblem” and “intra- 
problem” learning. Although at first 
glance these terms are more operational 
than the terms specific and general, they 
do depend on an arbitrary definition of 
“problem.” In the present task it seems 
reasonable to think in terms of two 
“problems”: that which is largely per- 
ceptual and hinges around the ability to 
discriminate patterns, and that which is 
more predominantly motor, including 
both hand and eye movements, i.e., the 
ability to indicate the discrimination. 
For both these there can be clearly inter- 
problem and intraproblem learning and 
for both there may well be the con- 
tinuity of the process suggested by 
Meyer. The present data do not lend 
themselves to the same kind of quantifi- 
cation as that suggested by him, since 
the study involved a rather narrow scale 
in time and space, i.e., a few samples 
and training Ss for a few trials only. 
However, the material developed for the 
study of visual figure discrimination 
lends itself to extensive studies on human 


Ss comparable to those carried out with 
monkeys and prelanguage children by 


Harlow, Meyer, and others. It is more 
comparable to the material used in 
animal discrimination and reversal learn- 
ing than were the nonsense syllables used 
in human learning-set studies by Meyer 
and Miles (8). They noted that Ss 
learning successively lists of nonsense 
syllables did not show as much benefit 
from progressive transfer as might have 
been expected on the basis of animal 
learning-set studies. It is possible that 
this was due to the increasing proba- 
bility of intrusions from earlier lists 
having a detrimental effect on the 
learning of subsequent ones. In prob- 
lem solving or identification tasks, where 
it may not be necessary to name a 
relation or an object, the probability of 
such intrusions having a detrimental 
effect is much less. 


SUMMARY 


Four groups of 45 Ss each were used in an 
experiment to examine the effects of transfer 


329 


between samples of the same class of material. 
It was shown that: 

1. The Ss transferring between different 
samples of the same class of material (Group II) 
showed transfer effects which increased from 
6O% to 87% from the first to the third post- 
transfer trial when compared with a nontrans- 
ferring control group (Group I). 

2. Another group (IV) pretrained. on the 
same task but using material of a different kind 
from that of the control group showed improve- 
ments in transfer from 34% to 80% from the 
first to the third transfer trial. 

3. The performance of Groups II and IV 
differed significantly only on the first post- 
transfer trial. 

4. Part of the transfer effect shown by Group 
II could be accounted for by familiarity with the 
task rather than the class of material used. 

5. Pretraining on an identical sample of 
figures but on a different task (Group III) 
produced about the same transfer effects as 
pretraining on the same task but a different 
sample of figures (Group II). 


REFERENCES 


. Duncan, D. B. A significance test for dif- 
ferences between ranked treatments in 
an analysis of variance. Va. J. Sci., 
1951, 2, 171-189. 

. Frrrs, P. M., Wetnstem, M., Rappaport, 
M., Anperson, N., & Leonarp, J. A. 
Stimulus correlates of visual pattern 
recognition. J. exp. Psychol., 1956, 51, 
1-11. 

. Gacnfé, R. M., Foster, H., & Crowrey, 
M. E. The measurement of transfer of 
training. Psychol. Bull., 1948, 45, 97- 
130. 

. Harrow, H. F. The formation of learning 
sets. Psychol. Rev., 1949, 56, 51-65. 

. Harter, H. L., & Lum, M. D. Partially 
hierarchical models in the analysis of 
variance. USAF Wright Air Develop- 
ment Center, Tech. Rep. WADC TR 
55-33, 1955. 

. McGeocn, J. A., & Inton, A. L. 
chology of human learning. 
Green, 1952. 

. Mever, D. R. Food deprivation and 
discrimination reversal learning by mon- 
keys. J. exp. Psychol., 1951, 41, 10- 
16. 

. Mever, D. R., & Mires, R. C. Intralist- 
interlist relations in verbal learning. J. 
exp. Psychol., 1953, 45, 109-115. 


The psy- 


Longmans, 


(Received April 30, 1956) 





Journal of Experimental Psycholog 
Vol. 53, 1,2 f 1957 - , 


APPLICATION OF THE HALF-SPLIT TECHNIQUE TO 
PROBLEM-SOLVING TASKS !# 


ROBERT A, GOLDBECK,* BENJAMIN B. BERNSTEIN,‘ 
W. A. HILLIX, AND MELVIN H. MARX 


University of Missouri 


The need to maintain complex 
electronic equipment has stimulated 
interest in a, class of problem solving 
called trouble shooting. Faulty com- 
ponents in electronic systems must be 
located on the basis of the perform- 
ance of the system and its parts. 
Solution of a problem may be con- 
sidered to have two parts: the deter- 
mination of a set of possible faulty 
components, and the elimination of 
all but the component that is actually 
faulty through some checking pro- 
cedure. 

Miller, Folley, and Smith (3) have 
presented a logical demonstration of 
the superior efficiency of the half-split 
technique for problems so considered. 
Each check eliminates half of the 
alternatives. However, there has 
been no empirical examination of this 
solution method, the effectiveness of 


! This research was supported in part by the 
United States Air Force under Contract No. 
AF 18(600)-1053 monitored by the Maintenance 
Laboratory, Air Force Personnel and Training 
Research Center, Lowry Air Force Base, Denver, 


Colorado. Permission is granted for repro- 
duction, translation, publication, use, and dis- 
posal in whole or in part by or for the United 
States Government. 

* Experiment I represents the major part of a 
doctoral dissertation by the first-named author 
at the University of Missouri. The second- 
named author was primarily responsible for 
Exp. Il. ‘The third-named author is primarily 
responsible for the form of the present report. 
The fourth-named author served as principal 
investigator and supervisor of the research. 

* Now at American Institute for Research, 
Pittsburgh, Pennsylvania. 

* Now at Bombing-Navigation Research Unit, 
Operator Laboratory Field Unit No. 4, Air Force 
Personnel and Training Research Center, Mather 
Air Force Base, California. 


which will depend upon human capa- 
bility to use it. 

First, a set of trouble alternatives 
(possible faulty components) must 
be determined. Then check points 
which divide the set of alternatives 
into halves must be located. If the 
trouble alternatives are directly given 
and have a simple structure which can 
be easily divided into halves, empirical 
results should follow closely the 
mathematics of information theory. 
But if either of these two aspects of 
the method is complicated by com- 
plexities in the structure of the 
system, method effectiveness should 
be more intimately related to the 
capabilities of Ss. 

The two experiments reported here 
were designed to provide empirical 
tests of application of the half-split 
method. The major purpose of our 
first experiment was to examine the 
effects of problem complexity upon 
the efficiency of solution for Ss in- 
structed in the half-split method and 
for Ss not so instructed. Specifically, 
the experiment was designed (a) to 
test the hypothesis that half-split 
instruction increased efficiency for 
simple problems, and (b) to determine 
half-split efficiency for more complex 
problems. As implied in the pre- 
ceding paragraph, it was expected 
that the half-split instructions would 
improve efficiency for the simpler 
problems, but not necessarily for the 
more complex. 

The second experiment was de- 
signed to determine whether addi- 
tional practice, different methods of 


330 





HALF-SPLIT TECHNIQUE 


tH EHH HEH -© 


Bt HEEH EH EH EHH EM 


Bis Cee Oe 
HEHEHE eee eet -© 


Fic. 1. 


instruction, and more complex train- 
ing materials would produce an im- 
provement in S’s ability to arrive at 
a set of alternatives, and, if so, 
whether the half-split method would 


Four different 
were tested for 


then be more efficient. 
training methods 
effectiveness. 


EXPERIMENT [| 
Method 


A pparatus.—The problems were presented to 
S by means of an apparatus designed to simulate 
an electronic system, and a diagram of the 
system relationships. The system consisted of 
10 display items and 100 connected components. 
The apparatus is described elsewhere (2) in more 
detail. 

The diagram of the simplest system is shown 
in Fig. 1. Each of the 10 display items termi- 
nated a signal chain consisting of 10 units con- 
nected in series. Each signal chain was con- 
nected to one of the other signal chains at a 
random point in the chain. Thus each simple 
chain had one remote connection ; this was either 
an input to the chain or an output from it. This 


() 


HEHEHE 
EHH HO 
EH EEEHHHEHHEHO 
AEH EHE EH EHH 


o 
rs 


AHHH EAE -O 


E 
E 
E 


Complexity I diagram. 


system diagram, used in Exp. I at Complexity I, 
formed the skeleton for the other two diagrams. 
For the Complexity II diagram, an additional 
remote connection was made for each of the 
chains. For Complexity III, a third connection 
was made. As complexity increased, the signal 
network feeding the display items became more 
complicated and a faulty unit caused more 
symptoms (display items that were off) in the 
symptom array. 

The display items were presented as nine 
lights and a meter on a panel. When operating 
properly, the components transmitted a signal 
to the display items connected to them, causing 
the nine lights to be on and the meter to give a 
specified reading. 

A second panel consisted of switches and an 
associated light. By throwing the switch repre- 
senting the check point of any one of the 100 
components under the display items, S could 
find out whether or not the signal was present 
at the output of the component. If it was, the 
light would come on; if not, the light would 
remain off. 

There was a third panel which also had a 
switch for each of the 100 components. When 
S thought he knew the faulty component, he 
could throw the associated switch on this third 
panel, thus effectively replacing this component. 





332 R. A. GOLDBECK, B. B. BERNSTEIN, W. A. HILLIX, AND M. H. MARX 


If that component was really the faulty one, the 
items on the display panel that had been off 
would now come back on, thus showing S that 
he had solved the problem. 

The Ss were given a problem-solving test 
(PST) designed to measure ability on those 
aspects of the trouble shooting that concerned 
the deduction of the set of trouble alternatives. 
It consisted of 20 questions requiring S to count 
units that had specified relationships to other 
units in the network printed on a diagram. ‘The 
64 units were identified by pairs of letters. 

Subjects.—The Ss were 90 male university 
students drawn from a general experimental 
psychology class. ‘Thirty Ss were assigned to 
each of the three complexity levels; 15 were given 
each method of instruction. 

Procedure.—On Day 1, S was given a diagram 
for the complexity level to be used during testing, 
but with remote connection inputs and outputs 
at different points in the simple signal chains, in 
order to prevent specific transfer effects. ‘The 
S listened to recorded instructions covering the 
general functioning of the system. He was told 
that a trouble-shooting problem would be pre- 
sented as a pattern of on-and-off display items 
called the symptom array. A faulty component 
would cause whichever display items were con- 
nected to it to be off. By tracing the relevant 
connections on the system diagram, S could 
determine which units might cause the given 
pattern of symptom items to be off. He could 
also determine the most efficient place to check. 
He could check points in the system by means of 
switches on the checking panel. Finally, the 
checking information would enable him to locate 
the trouble source. The problem was solved 
when he threw the replacement switch for the 
faulty component, and the display items came 
back on. After hearing the instructions, S was 
allowed to ask questions and then required to 
answer a list of questions to check his compre- 
hension of the task. These questions required 
S to perform and interpret correctly specified 
checks and replacements, 

On Day 2, Ss were given a second set of 
check-out questions to insure retention of the 
instructions. Then Ss in the half-split (HS) 
groups listened to a 5.5-min. recording of in- 
structions on the half-split method and were 
asked if they had any questions about the 
method. Check-out questions followed which 
required S to correctly apply the half-split 
method, step by step, to two simple problems. 
The S was to examine the symptom array and 
the associated interconnected units. For any 
given array, there were several units whose 
malfunctioning could produce that array. 
These were called trouble-possible units. The 
S was to locate these and successively halve them 


by making the appropriate checks until the 
faulty unit was isolated and replaced. 

The Ss in the control groups received no 
special method instructions, but were given 
those parts of the check-out questions that did 
not involve the HS method as such. All Ss 
were told that their task was to find the trouble 
in the least number of checks possible; 20 min. 
would be allowed for each problem, but otherwise 
time was not a consideration. A total of 12 Ss 
failed to finish a total of 17 problems within the 
20-min. time limit. Estimates of times and 
checks to solution were made for these incom- 
pleted problems, based upon the degree of 
solution and the average rate at which S pro- 
gressed toward solution. 

The first problem given each S at all three 
complexity levels was a trouble with only one 
symptom—the simplest type of problem. For 
the next five problems all Ss were given the same 
five troubles (i.e., the same five “faulty units” 
were used regardless of method or complexity 
level). ‘The symptom array for a given trouble 
varied with complexity level. Trouble order 
was counterbalanced according to a latin square 
arrangement with three replications in each cell 
of the factorial design. 

Following testing on the electronic system Ss 
were given the PST. 


Results 


Checking scores and times.—The 
over-all method difference was not 
significant (F < 1) for a simple analy- 
sis of variance using method, com- 
plexity, and within-Ss variance. The 
Method X Complexity interaction 
was also not significant (F < 1). As 
was expected, the complexity dif- 
ference was significant (F = 3.89, 
P < .O5 for df = 2, 84); the number 
of checks was positively related to 
complexity. 

Since a difference in favor of the 
HS group had been predicted at 
Complexity I, a separate analysis was 
carried out for Complexity I only. 
Here the method difference was sig- 
nificant (F = 4.68, P < .05 for 
df = 1, 24). 

The solution time for the HS groups 
was significantly longer than for the 
control (F = 7.44, P< .0l_ for 
df = 1, 84). This reflects the addi- 





HALF-SPLIT TECHNIQUE 


tional covert solution behavior asso- 
ciated with the attempt to reduce 
checking responses by means of the 
half-split procedure. With increasing 
complexity the times increased 
(F = 18.60, P < .0O1 for df = 2, 84), 
and the obtained HS-control differ- 
ence became more marked. 

PST analysis—In order to assess 
the effects of levels of PST perform- 
ance in conjunction with the main 
variables, Ss were divided into three 
levels according to the upper, middle, 
and lower third of the PST score 
distribution for each group.’ An 
analysis of variance was performed 
using these levels as an additional 
variable. The results agreed with 
those of the previous analysis and 
produced two additional findings. 
PST level had a highly significant 
effect (F = 9.20, P < .001 for df = 2, 
72), as did the PST Level K Com- 
plexity interaction (F = 3.49, P < .05 
for df = 4, 72). Increasing com- 
plexity resulted in increased check 
scores only for Ss scoring low on the 
PST. 

It would seem that the difficulty of 
complex problems was in large part 
dependent upon S_ characteristics 
which were measured by the PST. 

The product-moment correlations 
between PST and checking scores at 
the three complexity levels were as 
follows: for the half-split group, 
+ .16, — .31, and — .80; for the 
control group, + .24, — .63, and 

5The levels breakdown of PST scores by 
groups, some analysis of variance tables men- 
tioned in the text, the instructions for both 
experiments, and certain tabulations of data 
from both experiments, have been sent to the 
American Documentation Institute. Order 
Document No. 5145 from ADI Auxiliary 
Publications Project, Photoduplication Service, 
Library of Congress, Washington 25, D. C., 
remitting in advance $2.00 for microfilm or 
$3.75 for photocopies. Make checks payable to 
Chief, Photoduplication Service, Library of 
Congress. 


333 


— .80. The increase in correlation 
with complexity indicates a greater 
role for PST ability on complex 
problems. High PST scores and low 
checking scores represent good per- 
formance, producing negative r’s. 

Since the PST was administered to 
Ss in Exp. I after they had performed 
the apparatus problems, it was pos- 
sible that differential transfer from 
the apparatus problems to the PST 
occurred for the different experimental 
groups. To test this possibility, an 
analysis of variance was carried out 
on the PST scores classified according 
to the complexity and method vari- 
ables used for the apparatus problems. 
There was no evidence for differential 
transfer (F <1 for M and C, 1.48 
for M XC with df = 2, 84 for the 
latter). 

Reliability. —The reliability of the 


total-checks scores for each of the 


major experimental groups was de- 
termined by an analysis of variance 


technique developed by Hoyt (1). 
For the HS groups the reliability 
values were .62, .72, and .53 for 
Complexities I, II, and III, respec- 
tively. For the control groups the 
reliability values were .50, .99, and 
.74. These values indicate  rea- 
sonably high reliability for the ex- 
perimental situation. The split-half 
reliability using an odd-even break- 
down on the PST was .83. 

Half-split performance measures. 
The Ss were unable to benefit from 
instruction in the logically superior 
HS method, at least at the higher 
complexities. It is then appropriate 
to examine the reasons for this failure 
and to determine which aspects of the 
method present difficulty. 

Since HS Ss were instructed first to 
determine the units which if faulty 
could cause the symptom array pre- 
sented (trouble-possible units), the 
number of times each S’s first check 





334 R. A. GOLDBECK, B. B. BERNSTEIN, W. A. HILLIX, AND M. H. MARX 


was among the trouble-possible units 
was scored. The HS groups showed 
a slight advantage in beginning their 
responses in the trouble-possible area, 
but the methods effect was not sig- 
nificant (F = 2.29 for df = 1, 72). 
It is surprising that Ss who have been 
told how to go about limiting the 
scope of trouble alternatives were not 
significantly better at this part of the 
task. A strong effect of PST levels 
(F = 5.66, P <.01 for df = 2, 72) 
indicates that there was an important 
ability factor involved in determining 
trouble-possible units. 

A significant complexity by PST 
level interaction (F = 2.50, P < .05 
for df = 4, 72) resulted from a steeper 
decline in trouble-possible first checks 
with increasing complexity for low 
PST-scoring Ss than for middle- or 
high-scoring Ss. However, for the 
high PST level the complexity dif- 
ference was still significant (F = 4.24, 
P <.05 for df = 2, 20). These re- 
sults, plus the significant complexity 
effect (F = 10.04, P< .001 for 
df = 2, 72), indicate that increasing 
complexity had the effect of pre- 
venting Ss from accomplishing the 
first step in the half-split method used 
in this experiment. At the higher 
complexities Ss were limited by an 
ability factor in arriving at the 
trouble-possible set of alternatives by 
an interpretation of the symptom 
array. 

The second part of the HS method 
concerned making half-split checks 
among the trouble-possible units. 
Scoring each S for the number of 
perfect half-split checking sequences 
used in the trouble-possible area 
resulted in the superiority of the 
HS groups. The method difference 
was highly significant (F = 30.65, 
P <.001 for df = 1, 72). Here there 
was no significant complexity differ- 
ence. However, with increasing com- 


plexity the number of trouble-possible 
units was generally smaller, requiring 
fewer steps in the half-split checking 
sequence. 

To eliminate this bias, only the 
first check in the trouble-possible area 
was considered. An analysis of vari- 
ance using this measure showed a 
significant method effect (F = 71.12, 
P <.001 for df=1, 72) and a 
Complexity by Method interaction 
(F = 4.57, P < .05 for df = 2, 72) 
This resulted from a reduction, in 
half-split checks, with increased com- 
plexity by the HS group. Since the 
trouble-possible units were contained 
in simple lines at all complexities 
(except for two troubles at Com- 
plexity II where two lines were in- 
volved), this reduction in half-split 
checks can be attributed to the 
difficulty in the first step of getting 
a set of alternatives. If the trouble 
alternatives cannot be recognized as 
such, they cannot be split in half. 

Correct solutions.—Although pri- 
mary emphasis has been placed upon 
the number of checks used to solve a 
problem, consideration of whether or 
not Ss’ first solution was a correct one 
is of some importance. When each 
S was scored for the number of correct 
first replacements for the five prob- 
lems, only PST level showed a ‘sig- 
nificant effect (F = 5.99, P < .O1 for 
df = 2,72). This indicates that this 
aspect of performance was determined 
by the characteristics of S rather than 
by the situational variables. 

In addition, each S was scored for 
the average number of checks used 
for finding troubles where he made 
no incorrect replacements. There 
was a significant method effect 
(F = 23.47, P < .001 for df = 1, 72) 
and significant Method K Com- 
plexity interaction (F = 5.77, 
P <.0l for df =2, 72). The HS 


group was most superior at Com- 





HALF-SPLIT TECHNIQUE 


plexity III. Where S avoided any 
incorrect solutions, the half-split 
method was more effective. 


The results of Exp. I may be summed 
up as follows: (a) The increase in number 
of checks required with increased com- 
plexity indicates the importance of being 
able to deduce the trouble-possible units 
correctly, especially since the number 
of trouble-possible units decreased with 
increasing complexity. If Ss could have 
conceptualized system relationships per- 
fectly, fewer checks would have been 
required. (4) The HS method cannot in 
general be said to be superior. (c) There 
is evidence that it is superior under 
certain conditions, such as with diagrams 
of low complexity, where Ss apparently 
had little difficulty in finding the trouble- 
possible units, and where Ss avoided 
incorrect replacements. 

This latter finding gave some hope that 
if all Ss could be given increased training 
on working with the system relation- 
ships, then the HS method would be 
superior even with more complex dia- 
grams. Experiment II was an attempt 
to improve the effectiveness of the HS 
method by improving instruction and 
giving extended practice. 


EXPERIMENT II 
Method 


Apparatus.—The same apparatus was used 
as in Exp. I, except that only the system diagram 
of greatest complexity (three remote connec- 
tions) was used during test trials. The training 
diagram was different, being the previous test 
diagram of intermediate complexity (two remote 
connections). 

Subjects.—The Ss were 40 male students from 
a general experimental psychology class at the 
University of Missouri. Ten Ss were randomly 
assigned to each of the four instructional groups. 

Procedure.—Before beginning either the in- 
structions or the experimental sessions in trouble- 
shooting, all Ss were given the PST for use in 
analysis of covariance. The basic instructional 
procedure was similar to that of Exp. 1. How- 
ever, all Ss had 2-hr. sessions on the first day, 
rather than the I-hr. session of Exp. I. All Ss 
had additional practice with the diagram of 
intermediate complexity, with special conditions 
of practice for each group. 


335 


Group C (control) was essentially like the 
control group of Exp. I, except for the additional 
practice. During this practice, E asked S to 
specify the check he wanted made. If it did not 
violate any basic instructions, S was allowed to 
make the check. 

Group DHS (Deductive Half-Split) was like 
the previous Group HS, with additional practice 
on the more complex diagrams. During the 
supervised practice for this group, S was allowed 
to make the check only if he had correctly 
deduced the trouble-possible items, and if the 
check correctly half-split those units. If it was 
an incorrect check, S was told only that it was 
incorrect because it was not a half-split check, 
whereupon S kept trying until the correct check 
was made. 

Group DA (Deductive Alternatives) differed 
from the groups previously used, ‘The rationale 
for setting up this new group is as follows: It is 
not necessary to arrive at the minimum set of 
trouble-possible units in order to find the trouble, 
so long as the set arrived at contains the faulty 
unit. It may be that S can do better where he 
is required only to find some set of alternatives 
that includes the trouble spot, if finding the 
minimum set would strain his capabilities, as 
happened in Exp. I. Thus, Group DA was 
instructed in three alternative ways of conceiving 
of the set of alternatives, and allowed to choose 
any one of them for half-split application during 
the test. The minimum set consisted only of 
the trouble-possible components, just as in 
Group DHS, ‘The maximum set consisted of all 
components in the lines beneath display items 
that were off. The intermediate set was ob- 
tained by eliminating those lines in the maximum 
set which did not have components feeding into 
other off-display items. For example, if there 
were three display items off, all the components 
beneath the off-display items (30 components) 
would constitute the maximum set. If one of 
the three lines involved did not have components 
feeding into either of the other two off-display 
items, it could not possibly contain the faulty 
component and thus could be eliminated from 
consideration for the intermediate set, which 
would then consist of 20 components. Further 
considerations might allow elimination of further 
components for the determination of the mini- 
mum set. During the practice session for this 
group, £ told S what kind of set to split and 
allowed only a half-split check of the designated 
set 


Group HS (Half-Split) received varied prac- 
tice in half-splitting sets of components, but was 
given no practice in determining the sets of 
alternatives. The symptom array was not used 
during practice for this group. The £ told S 





336 R. A. GOLDBECK, B. B. BERNSTEIN, W. A. HILLIX, AND M. H. MARX 


TABLE 1 


Grour Means rer Perrormance ann PST 
Measures, Exp. II 


Group 





Cc | DHS DA 


Mean chec ks to solution 57. 22.3 | 9A 
Mean trouble- possible 
first checks : 


1 
A 3.8 18 
Mean PST scores 114 


14.1 | 12.4 








which units to split, and corrected him only if 
checks did not correctly half-split the given set 
of alternatives. 

Test procedure was like that in Exp. I. 
There were two trouble-by-trial latin square 
arrangements in each group. The Ss were 
allowed 25 min. to finish a problem. 


Results 


Checking scores and times.—Both 
the mean number of checks required 
to find the five troubles and the mean 
number of times S’s first check was 
among the trouble-possible units are 
shown in Table 1, along with the 
group means on the PST. The 
groups scored in the same order on 
all three measures, and the group 
difference on the PST was almost 
significant (F = 2.62, with 2.86 re- 
quired for P = .05, df =3, 36), 
despite the fact that Ss were randomly 
assigned. 

Therefore, an analysis of covariance 
was used to determine whether the 
groups differed due to treatments, in 
addition to the variation correlated 
with the PST. Since the means and 
variances for number of checks to 
solution were correlated, and the 
variances were significantly hetero- 
geneous (x? = 10.99, P < .02), a log 
transformation was used and _ suc- 
cessfully corrected for the hetero- 
geneity. ‘The method difference for 
the transformed scores was significant 
(F = 6.75, P < Ol for df = 3, 35). 
The adjusted log means for the groups 
were as follows: C, 1.68; DHS, 1.39; 


DA, 1.59; and HS, 1.66. Tests of 
these means, using a generalized error 
term of .069 derived from the analysis 
of covariance, showed that Group 
DHS was superior to each of the other 
groups at the .O1 level. None of the 
other comparisons was significant. 

The analysis of covariance of the 
trouble-possible first check scores 
yielded a highly significant treatment 
difference (F = 13.22, P < .001 for 
df = 3, 36). The adjusted means for 
the four groups were: C, 1.44; DHS, 
3.50; DA, 1.71; and HS, 46. The 
t tests showed the DHS group superior 
to all other groups at beyond the .001 
level. The HS group was inferior to 
all other groups at beyond the .05 
level. The only insignificant differ- 
ence was between Groups DA and C. 
This result indicates that the different 
instructions and conditions of practice 
differed in their effectiveness in help- 
ing S to accomplish the first step 
of the trouble-shooting process. The 
difference in number of checks to 
solution seems attributable to differ- 
ences in ability to deduce a set of 
trouble-possible units, rather than 
to differences in ability to half-split 
alternatives, once determined. The 
analysis of covariance on time to 
solution did not yield any significant 
results. 

Another subsidiary measure of effi- 
ciency was the number of times S 
“replaced” a unit on the replacement 
panel. Perfect efficiency would in- 
volve using only as many replace- 
ments as there were troubles, five 
in this case. ‘The average numbers 
were: C, 9.0; DHS, 8.0; DA, 10.1; 
and HS, 12.1. The analysis of co- 
variance of these scores did not 
approach significance. 

PST scores.—Product-moment cor- 
relations between number of checks to 
solution and PST scores for the four 
groups were as follows: C, — .46; 





HALF-SPLIT TECHNIQUE 


DHS, — .59; DA, — .56; and HS, 
— .59. The required r for signifi- 
cance at the .05 level is .60 for the 
small N of 10. The correlations were, 
of course, negative since low check 
scores indicate better performance 
than high check scores. 

Comparison with Exp. I.—Groups 
C and DHS from Exp. II were com- 
bined with the two groups from Exp. 
I in an analysis of covariance with 
PST scores as the predictor variable. 
This use seems justified despite the 
fact that PST was administered after 
the apparatus test in Exp. I, and 
before the apparatus test in Exp. II, 
since there was no evidence of dif- 
ferential transfer in Exp. I. 

The analysis showed a significant 
Study X Method interaction (F =6.34, 
P < .02 for df = 1, 45). Since the 
variances were again heterogeneous 
(x? = 12.85, P < .01), a conservative 
interpretation is necessary. Even so, 
this comparison tends to support the 
observation that the instructions and 
practice given Group DHS in Exp. II 
were more effective in reducing the 
number of checks than were the less 
extensive instructions in Exp.I. The 
mean number of checks for Exp. I at 
the high complexity were C, 47.8; HS, 
45.3; for Exp. II, C, 57.1; DHS, 22.3. 

The analysis of covariance of the 
trouble-possible first check scores for 
the two studies also showed a signifi- 
cant Study X Method interaction (F 
= 6.09, P< .02 for df=1, 45). 
Here the variances were not hetero- 
geneous (x? = 5.30, P > .10). The 
method of instruction seemed to 
produce a dramatic change in the 
ability to deduce a set of trouble- 
possible units as well as in the ability 
to locate a faulty unit in a minimal 
number of checks. 


Discussion 


Although the half-split method has 
been demonstrated to be logically su- 


337 


perior (3), the results of the two experi- 
ments herein reported indicate that its 
empirical superiority is a function of 
the nature of the task, and the degree 
and type of training given the trouble 
shooter. The S’s checks more closely 
approximate those which would be 
expected logically when task complexity 
is low or instruction in deducing trouble- 
possible units is high. 

There seem to be two components in 
the empirical process of trouble shooting. 
The first component involves finding 
the trouble-possible units. The second 
component involves half-splitting the 
trouble-possible units at each stage by 
some checking operation, or at least 
eliminating part of the units from con- 
sideration. In the present experiments 
the second step was relatively simple. 
Those Ss who were able to locate the 
trouble-possible units, as measured by 
the occurrence of the first check in the 
trouble area, were able to perform effi- 
ciently in finding the troubles. 

Wherever a significant difference in 
number of checks was found between 
groups, there was an accompanying 
difference between the number of initial 
checks in the trouble-possible area to 
help explain it. In Exp. I, failures to 
find expected differences were correijated 
with difficulty in isolating trouble- 
possible units. Also, half-split Ss who 
avoided making any incorrect replace- 
ments were more efficient than com. 
parable control Ss. The importance of 
the deductive component was further 
substantiated by the marked superiority 
of DHS Ss in Exp. I] as compared to HS 
Ss. This difference seems attributable 
to their greater practice in the deductive 
component. 

It appears then that the failure to find 
a general superiority for the half-split 
method is the result of S’s inability to 
perform efficiently in the more funda- 
mental deductive task of isolating sets 
of trouble units. This result suggests 
that it will be more profitable in future 
research on training methods to empha- 
size the more basic deductive phase 
rather than the relatively mechanical 
half-splitting aspect of the process. 





338 R. A. GOLDBECK, B. B. BERNSTEIN, W. A. HILLIX, AND M. H. MARX 


SUMMARY 


This is a report of two experiments on the use 
of the half-split method of locating trouble 
sources in malfunctioning equipment. The 
experiments were carried out using apparatus 
and diagrams designed especially for this 
purpose. Experiment I investigated the effects 
of differing complexity of system and differing 
instructions on efficiency in locating trouble 
sources, Experiment I] was a further investi- 
gation of the effects of instruction and practice 
for the high complexity condition. 

Both sets of results indicated that where the 
system relationships were easily mastered by a 
given S§ the half-split method was an aid to 
efficiency. However, S needed either relatively 
high ability or instructional aid to overcome the 
load put on his capacities by the more complex 
systems. He had to be able to deduce from the 
symptom complex which units might be faulty 
in order to use the half-split method efficiently. 
It is probable that this ability also indicated that 
these Ss were better able to understand the 


meanings of the checks made. It was concluded 
that deductive ability is a prerequisite for appli- 
cation of the half-split technique, and indeed 
may play a preponderant role at each stage of the 
trouble-shooting task. 


REFERENCES 


1. Hoyt, C. J. Test reliability estimated by 
analysis of variance. Psychometrika, 
1941, 6, 153-160. 

2. Marx, M. H., Gotppecx, R. A. & 
Beanstein, B. B. An apparatus for 
investigating the methods humans use in 
solving complex problems. Amer. J. 
Psychol., 1956, 69, 462-465. 

3. Mitier, R. B., Fourey, J. P., Jx., & Surrn, 
P. R. Systematic trouble shooting and 
the half-split technique. Lackland Air 
Force Base, Texas: Human Resources 
Research Center, July, 1953. (Tech. 
Rep. 53-21). 


(Received May 4, 1956) 





Journal of Experimental Psychology 
Vol. 53, No. 5, 1957 


THE RELATION OF EYE MOVEMENTS DURING SLEEP 
TO DREAM ACTIVITY: AN OBJECTIVE METHOD 
FOR THE STUDY OF DREAMING 


WILLIAM DEMENT! AND NATHANIEL KLEITMAN # 


Department of Physiology, University of Chicago 


The study of dream activity and 
its relation to physiological variables 
during sleep necessitates a reliable 
method of determining with precision 
when dreaming occurs. This knowl- 
edge, in the final analysis, always 
depends upon the subjective report 
of the dreamer, but becomes relatively 
objective if such reports can be sig- 
nificantly related to some physio- 
logical phenomena which in turn can 
be measured by physical techniques. 

Such a relationship was reported 
by Aserinsky and Kleitman (1) who 
observed periods of rapid, conjugate 
eye movements during sleep and 


found a high incidence of dream recall 
in Ss awakened during these periods 


and a low incidence when awakened 
at other times. The occurrence of 
these characteristic eye movements 
and their relation to dreaming were 
confirmed in both normal Ss and 
schizophrenics (4), and they were 
shown to appear at regular intervals 
in relation to a cyclic change in the 
depth of sleep during the night as 
measured by the EEG (5). 

This paper represents the results 
of a rigorous testing of the relation 
between eye movements and dream- 
ing. Three approaches were used: 
(a) Dream recall during rapid eye 
movement or quiescent periods was 
elicited without direct contact be- 
tween E and S, thus eliminating the 

! Postdoctoral Public Health Service Research 
Fellow of the National Institute of Neurological 
Diseases and Blindness. 

2 Aided by a grant from the Wallace C. and 


Clara A. Abbott Memorial Fund of the Uni- 
versity of Chicago. 


possibility of unintentional cuing by 
E. (b) The subjective estimate of 
the duration of dreams was compared 
with the length of eye movement 
periods before awakening, reasoning 
that there should be a positive cor- 
relation if dreaming and eye move- 
ments were concurrent. (c) The pat- 
tern of the eye movements was related 
to the dream content to test whether 
they represented a specific expression 
of the visual experience of dreaming 
or merely a random motor discharge 
of a more active central nervous 
system. 


MeTHopD 


The Ss for the experiments were seven adult 
males and two adult females. Five were studied 
intensively while the data gathered from the 
other four were minimal with the main intent of 
confirming the results on the first five. 

In a typical experiment, S reported to the 
laboratory a little before his usual bedtime. He 
was instructed to eat normally but to abstain 
from alcoholic or caffeine-containing beverages 
on the day of the experiment. Two or more 
electrodes were attached near the eyes for 
registering changes in the corneoretinal potential 
fields as the eyes moved. Two or three elec- 
trodes were affixed to the scalp for recording 
brain waves as a criterion of depth of sleep. 
The S then went to bed in a quiet, dark room. 
All electrode lead wires were further attached to 
the top of the head and from there to the lead 
box at the head of the bed in a single cord to 
minjmize the possibility of entanglement and 
allow S a free range of movement. The po- 
tentials were amplified by a Model III Grass 
Electroencephalograph in an adjoining room. 
The electroencephalograph was run continuously 
throughout the sleep period at a paper speed of 
3 or 6 mm. per sec. which allowed easy recog- 
nition of eye-movement potentials. A faster 
speed (3 cm./sec.) was used for detailed ex- 
amination of the brain waves although the 
slower speed permitted at least an approximate 


339 





WILLIAM DEMENT AND NATHANIEL KLEITMAN 


TABLE 1 


Summary or Experiments 


A we Average 
Nightly 
Awakenings 




















Totals 





estimation of the gross pattern. The criteria of 
eye-movement potentials and their differentia- 
tion from brain wave artifacts have been dis- 
cussed at length elsewhere (1, 4). 

At various times during the night Ss were 
awakened to test their dream recall. ‘The return 
to sleep after such an awakening invariably took 
less than 5 min. ‘Table 1 is a summary of the 
experiments showing the number of nights each 
S slept and the number of awakenings. In all, 
21% of the awakenings fell in the first 2 hr. of 
sleep, 29% in the second two, 28% in the third 
two, and 22% in the fourth two. 


RESULTS 


The occurrence of rapid eye move- 
ments.*—Discrete periods during 
which their eyes exhibited rapid 
movements were observed in all nine 
Ss every night they slept. These 
periods were characterized by a low- 
voltage, relatively fast pattern in the 
EEG. ‘The interspersed periods in 
which rapid eye movements were 
absent showed EEG patterns in- 
dicative of deeper sleep, either a 
predominance of high-voltage, slow 
activity, or frequent, well-defined 
sleep spindles with a_ low-voltage 
background. No REM’s were ever 
observed during the initial onset of 
sleep although the EEG always passed 

In most of the remaining text the following 
abbreviations will be used: REM’s (rapid eye 


movements) and NREM's (no rapid eye move- 
ments). 


through a stage similar to that ac- 
companying the rapid eye movement 
periods occurring later in the night. 
These findings concerning associated 
EEG patterns were identical with 
previous observations on uninter- 
rupted sleep (5). 

An accurate appraisal of the mean 
duration of the REM periods was 
impossible since most were terminated 
artificially by an awakening. How- 
ever, those that were not so termi- 
nated varied between 3 and 50 min. 
in duration with a mean of about 20 
min., and they tended to be longer 
the later in the night they occurred. 
The eyes were not constantly in 
motion during such periods; rather, 
the activity occurred in bursts of one 
or two, up to fifty or a hundred 
movements. A single movement was 
generally accomplished in .1l—.2 sec. 
and was followed by a fixational pause 
of varying duration. The amount, 
pattern, and size of the movements 
varied irregularly from period to 
period. 

The REM periods occurred at 
fairly regular intervals throughout the 
night. The frequency of occurrence 
seemed to be relatively constant and 
characteristic for the individual. DM 
and WD averaged one eye-movement 
period every 70 min. and every 75 
min. respectively. KC averaged one 
eye-movement period every 104 min. 
The other Ss fell between these two 
extremes. ‘The average for the whole 
group was one REM period every 92 
min. 

Despite the considerable disturb- 
ance of being awakened a number of 
times, the frequency and regularity 
with which REM periods occurred 
was almost exactly comparable to 
that seen previously in a study of 
uninterrupted sleep (5). If the awak- 
ening occurred during a NREM 
period, the return to sleep was never 





EYE MOVEMENTS DURING SLEEP 


associated with REM’s, nor was the 
time of onset of the next REM period 
markedly changed from that which 
would have been expected in the 
absence of an awakening. An awak- 
ening during an REM period gen- 
erally terminated the REM’s until 
the next period, and the sequence of 
EEG changes, excluding the brief 
period of wakefulness, was the same 
as that following an REM period 
that ended spontaneously. Excep- 
tions occurred when S was awakened 
during an REM period in the final 
hours of sleep when the period was 
likely to be quite long if uninter- 
rupted. On these occasions, the 
REM’s sometimes started up again 
when S fell asleep. It seemed as 
though a period of heightened CNS 
activity had not run its normal course 
and, although S was able to fall 
asleep, he continued to dream. 

Eye movement periods and dream 
recall.—For all awakenings to elicit 
dream recall, the arousing stimulus 
was the ringing of an ordinary door- 
bell placed near the bed and suffi- 
ciently loud to ensure immediate 
awakening in all levels of sleep. The 
Ss then spoke into a recording device 
near the bed. They were instructed 
to first state whether or not they had 
been dreaming and then, if they could, 
to relate the content of the dream. 
When S had finished speaking &, 
who could hear their voices, occasion- 
ally entered the room to further 
question them on some particular 
point of the dream. There was no 
communication between S and E in 
any instance, it must be emphasized, 
until S had definitely commited him- 
self. The Ss were considered to have 
been dreaming only if they could 
relate a coherent, fairly detailed 
description of dream content. As- 
sertions that they had dreamed with- 
out recall of content, or vague, frag- 


341 


TABLE 2 


Instances or Dream Recatt Arrer Awak- 
entincs Durinc Periops or Rarip Eve 
Movements or Periops or No 
Rapip Eve Movements 


No Rapid Kye 
Movements 


Rapid Eye 
Movements 


Dream No 
Recall 


| 
| 
| 
| 
} 


Dream 


Recall Recall 











Totals | 152 


mentary impressions of content, were 
considered negative. 

The awakenings were done either 
during REM periods or at varying 
increments of time after the cessation 
of eye movements during the inter- 
spersed periods of NREM’s. ‘The Ss, 
of course, were never informed when 
awakened whether or not their eyes 
had been moving. 

Table 2 shows the results of the 
attempts to recall dreams after the 
various awakenings. The REM or 
NREM awakenings for PM and KC 
were chosen according to a table of 
random numbers to eliminate any 
possibility of an unintentional pat- 
tern. For DN, a pattern was fol- 
lowed: first three REM awakenings, 
then three NREM awakenings, and 
so on. WD was told he would be 
awakened only when the recording 
indicated that he was dreaming, but 
REM and NREM awakenings were 
then interspersed randomly ‘The 
type of awakenings for IR was 
chosen according to the whim of £ 

The Ss uniformly showed a high 
incidence of dream recall following 





342 


REM awakenings and a very low 
incidence of recall following awak- 
enings during periods of NREM’s 
regardless of how the awakenings 
were chosen. In particular, DN was 
not more accurate than the others 
although there was a pattern he 
might have learned, and WD was not 
less accurate although he was de- 
liberately misled to expect to have 
been dreaming every time he was 
awakened. Over a narrow range, 
some Ss appeared better able to recall 
dreams than others. 

Table 3 compares the results of the 
first half of the series of REM awak- 
enings with the last half. Practice 
was certainly not a significant factor 
as only one S showed any degree of 
improvement of recall on later nights 
as compared with the early ones. 

The incidence of dream _ recall 
dropped precipitously almost imme- 
diately upon cessation of REM’s. In 
17 NREM awakenings that were done 
within 8 min. after the end of a REM 
period, 5 dreams were recalled. Al- 
though small, this was a much higher 
incidence of dream recall than oc- 
curred when the NREM awakenings 
followed the end of REM periods 
by more than 8 min. In the latter 
category only 6 dreams were recalled 
in 132 awakenings. 


TABLE 3 


Comparison or Fiast Har or Serres or Rarip 
Eye Movement Awakeninos Witn 
Seconp Har 





First Half Second Half 





Dream No 
Recall 


Dream No 
Recall Recall 





8 
3 
2 
3 
3 

















19 





WILLIAM DEMENT AND NATHANIEL KLEITMAN 


In general, Ss were best able to 
make an emphatic statement that 
they had not been dreaming when the 
NREM awakenings were done during 
an intermediate stage of sleep as 
indicated by a brain-wave pattern of 
spindling with a low-voltage back- 
ground. When aroused during a 
deep stage of sleep characterized by 
high-voltage, slow waves in the EEG, 
Ss often awoke somewhat bewildered. 
In this state they frequently felt that 
they must have been dreaming al- 
though they could not remember the 
dream or, on the other hand, that 
they had not been asleep atall. They 
sometimes had a great variety of 
feelings to describe—such as pleasant- 
ness, anxiety, detachment, etc., but 
these could not be related to any 
specific dream content. 

Most of the instances of inability 
to recall dreaming after awakenings 
during REM periods occurred in the 
early part of the night. Of 39 nega- 
tive reports in the entire study, 19 
occurred after awakenings during 
REM periods falling in the first 2 hr. 
of sleep, 11 after REM awakenings 
during the second 2 hr., 5 in the third 
2 hr., and 4 in the last 2 hr. There 
was no such variation relating to 
awakenings during the interspersed 
periods of ocular quiescence, the 
incidence of dream recall being uni- 
formly low, regardless of whether the 
early or late part of the night was 
being considered. 

Length of rapid eye movement periods 
and subjective dream-duration esti- 
mates.—If the length of the REM 
periods were proportional to the 
subjectively estimated duration of the 
dreams, it would further help to 
establish the relatedness of the two 
and would give some information 
about the rate at which dreaming 
progresses. 


At first, Ss were awakened at 





EYE MOVEMENTS DURING SLEEP 


TABLE 4 


Resutrs or Dream-Duration Estimates 
Arter 5 orn 15 Min. or Rapip 
Eve Movements 








5 Minutes 15 Minutes 





Right Wrong Right Wrong 





7 
12 
15 

8 





47 














various increments of time after the 
REM’s had begun and were requested 
to estimate to the nearest minute the 
amount of time they had been dream- 
ing. This proved to be too difficult, 
although the estimates were always 
of the same order of magnitude as the 
lengths of the REM periods, and were 
occasionally exactly right. 

A series was then done in which Ss 
were awakened either 5 or 15 min. 
after the onset of REM’s and were 
required on the basis of their recall 
of the dream to decide which was the 
correct duration. The 5- or 15-min. 
periods were chosen on the basis of a 
random series. Table 4 shows the 
results of these awakenings. All Ss 
were able to choose the correct dream 
duration with high accuracy except 
DN. This S, however, made most 
of his incorrect choices by estimating 
15 min. to be 5 min. This is con- 
sistent with the interpretation that 
the dream was longer, but he was only 
able to recall the latter fraction and 
thus thought it was shorter than it 
actually was. 

In addition to depending on the 
amount of actual dreaming, the 
lengths of the dream narratives were 
undoubtedly influenced by many 
other factors as, for example, the 
loquacity or taciturnity of S. How- 


343 


ever, the lengths of the dream narra- 
tives still showed a significant relation- 
ship to the duration of REM periods 
before awakening. Table 5 shows 
the correlations between minutes of 
REM’s and lengths of dream nar- 
ratives for each S. The number of 
words in the narrative was the meas- 
urement of length. Of the 152 
dreams recalled, 26 were not included 
because poor recording did not allow 
complete transcription. Dream nar- 
ratives recalled after 30 or as much as 
50 min. of REM’s were not a great 
deal longer than those after 15 min. 
although Ss had the impression that 
they had been dreaming for an un- 
usually long time. ‘This was perhaps 
due to inability to remember all the 
details of very long dreams. 

Specific eye-movement patterns and 
visual imagery of the dream.—The 
quality and quantity of the REM’s 
themselves showed endless variation. 
There was much or little movement, 
big or small movements, and so on. 
As has been stated, the movements 
occurred in bursts of activity sepa- 
rated by periods of relative inactivity. 
However, the brain-wave stage during 
the whole period remained the same 
whether there was much or little 
movement at any given moment of 
the period. 

It was hypothesized that the move- 
ments represented the visual imagery 
of the dream, that is, that they 


TABLE 5 


Corre.ation Between Duration or REM 
Periops 1s Minutes anp Number or 
Worps 1n Dream Narratives 





Number 
of Dreams 

















344 


corresponded to where and at what 
the dreamer was looking. An at- 
tempt to account for every movement 
by having S state chronologically in 
what directions he had gazed in the 
dream proved futile. The Ss could 
not recall the dream with such a high 
order of detail and precision. 

In a slightly different approach, Ss 
were awakened as soon as one of four 
predominant patterns of movement 
had persisted for at least 1 min. and 
were asked to describe in detail the 
dream content just before awakening. 
The four patterns were: (a) mainly 
vertical eye movements, (b) mainly 
horizontal movements, (c) both ver- 
tical and horizontal movements, and 
(d) very little or no movement. The 
prevalence of the horizontal or vertical 
components was determined by plac- 
ing leads both vertically and _ hori- 
zontally around the eyes. 

A total of 35 awakenings was 
accumulated from the nine Ss. 
Periods of either pure vertical or 


horizontal movements were extremely 


rare. ‘Three such periods of vertical 
movements were seen. After each 
of these the dream content involved 
a predominance of action in the 
vertical plane One S dreamed of 
standing at the bottom of a tall cliff 
operating some sort of hoist and 
looking up at climbers at various 
levels and down at the hoist ma- 
chinery. Another S dreamed of 
climbing up a series of ladders looking 
up and down as he climbed. In the 
third instance the dreamer was throw- 
ing basketballs at a net, first shooting 
and looking up at the net, and then 
looking down to pick another ball off 
the floor. Only one instance of pure 
horizontal movement was seen. In 
the associated dream S was watching 
two people throwing tomatoes at each 
other. On 10 occasions Ss_ were 
awakened after 1 min. of little or no 


WILLIAM DEMENT AND NATHANIEL KLEITMAN 


eye movement In these, the dreams 
all had the common property that 
the dreamer was watching something 
at a distance or just staring fixedly at 
some object In two of these awak- 
enings in different Ss the patterns 
were the same, as follows: about a 
minute of ocular inactivity followed 
by several large movements to the 
left just a second or two before the 
awakening. Both instances, inter- 
estingly enough, were virtually identi- 
cal as regards dream content. In 
one case S was driving a car and 
staring at the road ahead. He ap- 
proached an intersection and was 
startled by the sudden appearance of 
a car speeding at him from the left 
as the bell rang. In the other, the 
dreamer was also driving a car and 
staring at the road ahead. Just 
before the awakening he saw a man 
standing on the left side of the road 
and hailed him as he drove past. 

In the 21 awakenings after a mix- 
ture of movements Ss were always 
looking at things close to them, 
objects or people. Typical reports 
were of talking to a group of people, 
looking for something, fighting with 
someone, and so forth. There was 
no recall of distant or vertical 
activity. 

In order to confirm the meaning- 
fulness of these relationships, 20 
naive Ss as well as 5 of the experi- 
mental Ss were asked to observe 
distant and close-up activity while 
awake. Horizontal and vertical elec- 
trodes were attached. The eye-move- 
ment potentials in all cases were 
comparable in both amplitude and 
pattern to those occurring during 
dreaming. Furthermore, there was 
virtually no movement, as indicated 
by the eye potentials, when viewing 
distant activity, and much movement 
while viewing close-up activity. Ver- 
tical eye-movement potentials were 





EYE MOVEMENTS DURING SLEEP 


always at a minimum except for the 
upward movements accompanying 
blinking, and in a few cases when E 
tossed a ball in the air for them to 
watch. 


Discussion 


The results of these experiments indi- 
cate that dreaming accompanied by 
REM’s and a low-voltage electroen- 
cephalogram occurred periodically in 
discrete episodes during the course of a 
night’s sleep. It cannot be stated with 
complete certainty that some sort of 
dream activity did not occur at other 
times. However, the lack of recall and 
also the fact that the brain waves were 
at the lightest level of sleep only during 
REM periods and at deeper levels at all 
other times, makes this unlikely. The 
few instances of dream recall during 
NREM periods are best accounted for 
by assuming that the memory of the 
preceding dream persisted for an un- 
usually long time. This is borne out 
by the fact that most of these instances 
occurred very close, within 8 min., after 


the end of REM periods. 


Other workers have attempted to 
relate dreaming to physiological phe- 
nomena during sleep. Wada (12) felt 
that dreaming and gastric contractions 
occurred simultaneously. However, this 
conclusion was based on only seven 
awakenings in two Ss. One was unable 
to recall dream content although he felt 
he had been dreaming and the other 
remembered dream content in 3 of 4 
awakenings. Scantlebury, Frick, and 
Patterson (11) also studied gastric 
activity and dreaming. They felt, on 
the basis of three instances of dream 
recall out of seven awakenings, that the 
two were probably related, but judi- 
ciously stated that “the exact time 
during which a dream occurs is elusive 
of record.” The occurrence of dreaming 
during a series of foot twitches occurring 
immediately after the onset of sleep was 
postulated by McGlade (9). However, 
he based this conclusion mainly on 
dreams recalled on the morning after the 
experiments which is highly unreliable, 


345 


and only 3 out of the 25 Ss studied ex- 
hibited foot twitches. 

Incidental observations have been 
made on the occurrence of dreaming 
by investigators studying brain waves 
during sleep (2, 3, 6, 7, 8). All stages 
of brain waves were related to dreaming 
in these five papers, but no mention was 
made of whether or not actual dream 
content was recalled, and the number of 
reports by sleepers was generally very 
small. 

In other studies of dreaming, excel- 
lently reviewed by Ramsey (10), at- 
tempts were made to localize dream 
activity by simply awakening Ss at 
various times during the night. In 
general it was found that dreams might 
be recalled at any time during the night, 
but that most were recalled in the later 
hours of sleep. This would correspond 
to the statistical incidence of REM’s as 
previously reported (1, 4), and is also 
consistent with the finding in this study 
that, even when the awakenings occurred 
during REM periods, recall was still 
more difficult earlier in the night. 

It was stated herein that all Ss showed 
periods of REM’s every night they slept. 
This was also the case in another briefly 
reported series of experiments involving 
16 Ss who were observed a total of 43 
nights (5). It is felt on the basis of 
these and other studies which are un- 
reported that periods of REM’s and 
dreaming and the regularity with which 
they occur are an intrinsic part of normal 
sleep. In view of this, the failure to 
observe REM’s in occasional Ss reported 
in earlier work (1, 4) deserves some 
consideration. One explanation is that 
the recording was done by sampling 
rather than continuously. If the REM 
periods were shorter than usual, they 
may have occurred in the intervals 
between the samples, thus escaping 
observation. Another explanation is 
that a lower amplification of the REM 
potentials was employed which, although 
usually adequate, did not clearly record 
very small movements. A third possi- 
bility is that the dreams of these Ss 
happened to be the sort, such as watching 





346 


distant activity, in which eye movement 
was at a minimum. Since the asso- 
ciation of the characteristic low-voltage, 
non-spindling EEG was not realized at 
the time and thus could not aid in 
identifying this sort of period, they very 
likely would have been overlooked. 

There was nothing in the experiments 
reported in this paper to indicate that 
the dreams occurred instantaneously, or 
with great rapidity, as some have sup- 
posed. Rather, they seemed to progress 
at a rate comparable to a real experience 
of the same sort. An increment in the 
length of REM periods was almost 
invariably associated with a proportional 
increase in the length of the dream. 
This could not have occurred if dreaming 
were instantaneous, since any length of 
REM periods would then easily accom- 
modate a virtually infinite amount of 
dream activity. 

It seems reasonable to conclude that 
an objective measurement of dreaming 
may be accomplished by _ recording 
REM’s during sleep. This stands in 
marked contrast to the forgetting, dis- 
tortion, and other factors that are in- 


volved in the reliance on the subjective 


recall of dreams. It thus becomes 
possible to objectively study the effect 
on dreaming of environmental changes, 
psychological stress, drug administra- 
tion, and a variety of other factors and 
influences. 


SUMMARY 


Regularly occurring periods of REM’s were 
observed during every night of experimental 
sleep in nine adult Ss. A high incidence of 
dream recall was obtained from Ss when awak- 
ened during REM periods and a very low 
incidence when they were awakened at other 
times. A series of awakenings was done either 
5 or 15 min. after the REM’s (dreaming) had 
begun and Ss judged the correct dream duration 
with high accuracy. The pattern of the REM’s 
was related to the visual imagery of the dream, 
and the eye movements recorded in analogous 
situations while awake corresponded closely in 


WILLIAM DEMENT AND NATHANIEL KLEITMAN 


amplitude and pattern to those observed during 
dreaming. 


REFERENCES 


. Aserinsxy, E.,& Krerrman,N. Two types 
of ocular motility occurring in sleep. J. 
appl. Physiol., 1955, 8, 1-10. 

2. Brake, H., Gerarn, R., & Krerrman, N. 
Factors influencing brain potentials dur- 
ing sleep. J. Neurophysiol., 1939, 2, 
48-60. 

. Davis, H., Davis, P., Loomis, A. L., 
Harvey, FE. N., & Hopart, G. A. 
Human brain potentials during the onset 
of sleep. J. Neurophysiol., 1938, 1, 
24-38. 

. Dement, W. Dream recall and eye move- 
ments during sleep in schizophrenics and 
normals, j. nero. ment. Dis., 1955, 122, 
263-269. 

. Dement, W., & Krierrman, N. Incidence 
of eye motility during sleep in relation to 
varying EEG pattern. Fed. Proc., 1955, 
14, 216. 

. Henry, C. E. Electroencephalographic 
individual differences and their constancy. 
I. During sleep. J. exp. Psychol., 1941, 
29, 117-132. 

. Kwort, J. R., Henry, C. E., & Haney, 
J. M. Brain potentials during sleep; a 
comparative study of the dominant and 
non-dominant alpha groups. J. exp. 
Psychol., 1939, 24, 157-168. 

. Loomis, A. L., Harvey, E. N., & Hosart, 
G. A. Cerebral states during sleep as 
studied by human brain potentials. /. 
exp. Psychol., 1937, 21, 127-144. 

. McGuave, H. B. The relationship between 
gastric motility, muscular twitching 
during sleep and dreaming. Amer. J. 
digest. Dis., 1942, 9, 137-140. 

. Ramsey, G. Studies of dreaming. 
chol, Bull., 1953, $0, 432-455. 

. Scanttesury, R. E., Frick, H. L., & 
Patrerson, T. L. The effect of normal 
and hypnotically induced dreams on the 
gastric hunger movements of man. /. 
appl. Psychol., 1942, 26, 682-691. 

. Wava, T. An experimental study of hunger 
and its relation to activity. Arch. 


Psychol., N. Y., 1922, 8, No. 57. 


Psy- 


(Received May 9, 1956) 





Journal of Experimental Psychology 
Vol. 53, No. 5, 1957 


EFFECT OF IRRELEVANT THIRST MOTIVATION ON 
A RESPONSE LEARNED WITH FOOD REWARD 


G. ROBERT GRICE AND JOHN D. DAVIS 


University of Illinois 


The generalized response-activating 
character of drive (D) which has been 
hypothesized by Hull (6) and Spence 
(13) has received a fairly substantial 
amount of empirical support. Amsel 
(1) found that rats will perform a 
running habit motivated by fear more 
vigorously when they are also hungry. 
Amsel and Maltzman (2) found that 
the presence of emotionality will, 
under certain circumstances, increase 
the amount of drinking behavior. 
The Iowa studies of anxiety in human 
Ss have been based on this assumption 
and, in general, have yielded results 
in accord with prediction (e.g., 5, 12, 
16). In the case of purely appeti- 
tional drives, Webb (18) trained rats 
on a panel-pushing response under 
food deprivation for food reward and 
conducted extinction trials under vari- 
ous degrees of water deprivation with 
the hunger drive satiated. He found 
that the number of extinction re- 
sponses was an increasing function of 
the number of hours of water depri- 
vation. This result was confirmed by 
Brandauer (3) in a similarly designed 
experiment employing a bar-pressing 
response. 

Verplanck and Hayes (17) have 
raised a serious question concerning 
‘ the interpretation of the Webb ex- 
periment in terms of nonspecific 
activation by a generalized drive 
state. Their question concerns the 
possibility of the independent ma- 
nipulation of the hunger and thirst 
drives. They found that rats de- 
prived of water with food con- 
tinuously available and then allowed 
to drink would now eat an amount 


almost equivalent to the eating pro- 
duced by an equal period of food 
deprivation. They found, further, 
that water-deprived rats eat only 
about 60% as much food as rats not 
so deprived. Their findings were 
also similar for the reverse drive 
arrangement. The implication of 
these findings for studies like Webb’s 
is that the hunger drive may actually 
be present in considerable strength 
following water deprivation. Pos- 
sibly adequate hunger satiation is 
impossible because of the reduced 
eating by thirsty animals. Webb 
was also aware of this possible inter- 
pretation of his findings, but tenta- 
tively rejected it in favor of “Hull’s 
hypothesis of ‘a generalized drive 
state’ being ‘contributed to’ by the 
irrelevant drives (defined by depri- 
vation time) operative at the time” 
(18, p. 13). 

The present experiment is an at- 
tempt to investigate these alternative 
explanations of the role of an ir- 
relevant thirst drive. Four groups 
of rats were trained on a panel-push- 
ing response for food reward under 
hunger motivation. One control 
group was then given extinction trials 
under hunger motivation and a second 
while satiated with both food and 
water. One experimental group was 
tested under thirst conditions. This 
much constitutes the basic design of 
the Webb study. A second experi- 
mental group was subjected to the 
same water deprivation and food 
satiation procedures as the thirsty 
group, but was then allowed to drink 
to satiation prior to the test. The 


347 





348 


most obvious implication of the 
generalized drive hypothesis is that, 
with the irrelevant drive satiated, 
this group should be lower in response 
strength than the thirsty group. The 
implication of the Verplanck and 
Hayes finding, however, is that this 
group should respond more than the 
thirsty group, or at least be equal to 
it. 


MetTHOoD 
Subjects 


The Ss were 80 experimentally naive, albino 
rats of Sprague-Dawley strain. ‘There were 10 
males and 10 females in each of the four experi- 
mental groups. ‘Their ages ranged from 70 to 
90 days at the beginning of the experiment. 


Apparatus 


The apparatus consisted of a gray plywood 
box 12 in. long, 10 in. wide, and 6 in. high, inside 
dimensions. The top was of hinged lucite and 
there was a row of thirty-five yy-in. holes 44 in. 
above the floor extending around the walls to 
permit the passage of air. The apparatus was 
illuminated by a 15-w. bulb and reflector 24 in. 
above the floor at the rear of the box. At the 
middle of one end there was a lucite panel 14 in. 
wide by 6 in. high and hinged at the top. When 
closed the panel was flush with the end of the 
box. A vertical sliding door of }-in. masonite 
and painted the same as the box was placed 
immediately in front of the panel. With the 
door raised the panel was exposed to a height 
of 2} in. When the door, which was controlled 
manually by means of a string and counterweight 
arrangement, reached a height of 1 in., a holding 
relay was closed and a Standard Electric Timer 
was started. When the panel was pushed out- 
ward a distance of $ in. at its bottom, the circuit 
was broken by means of a tilting mercury switch, 
and the clock was stopped. Food pellets, con- 
sisting of Tail Curler Junior and weighing ap- 
proximately .05 gm., were placed in a slight 
depression in the floor immediately behind the 
panel. It was impossible for the rat to obtain 
the pellet without stopping the clock. 


Preliminary Training 


Throughout training Ss were maintained on a 
diet of 9 gm. of Purina Laboratory Chow per 
day, administered in individual feeding cages 
and subject to a l-hr. maximum eating period. 
Water was continuously available. They were 


G. ROBERT GRICE AND JOHN D. DAVIS 


adapted to this schedule for a minimum of seven 
and a maximum of nine days prior to preliminary 
training. On each of the last two days of the 
adaptation period, Ss received 10 pellets of the 
type used in training along with the daily ration. 
During preliminary training and training, Ss 
were fed immediately after the experimental 
session. Since, after adaptation, Ss actually 
ate in about 30 min., the hunger drive was pro- 
duced by approximately 234 hr. of deprivation. 
On the first pretraining day, 10 pellets were 
placed in the panel opening and Ss were allowed 
to eat them with the sliding door and the panel 
remaining open. On the second pretraining 
day, a single pellet was placed in the food hole 
with the panel open and the door closed. The 
door was raised and remained open until S had 
obtained and eaten the pellet. This was re- 
peated until S had consumed 10 pellets. 


Panel Training 


On the next day, a single pellet was placed 
in the apparatus with both the panel and door 
closed. ‘The door was raised and remained open 
until S had obtained the pellet by pushing the 
panel open with its nose. The door was then 
closed carefully to avoid frightening the animal. 
Latency was recorded to 1/100 sec. On the 
first training day only, E held the panel open 
with a string after the clock was stopped until 
S obtained the pellet on those few trials when it 
was necessary. Fifteen trials were presented - 
with an intertrial interval of 15 sec. timed from 
the lowering of the door at the end of one trial 
until it was raised at the beginning of the next. 
On each of the next two days, 15 additional 
trials were presented, making a total of 45 
acquisition trials, 


Test Day 


On the following day, 30 trials were presented 
under conditions like that during acquisition 
except that no food was presented. The 
criterion of the occurrence of a response was the 
pushing of the panel far enough to stop the 
clock. Whenever S failed to respond in 30 sec., 
the door was lowered and the next trial was 
presented after the intertrial interval. Such a 
trial was counted as no response and as a 
maximum latency. 

The procedures used to produce the different 
drive conditions in the four experimental groups 
were as follows: 

Hungry Group.—This group was tested under 
the same food deprivation conditions which 
prevailed during training. 

Satiated Group.—For this group, water and a 
supply of Purina Laboratory Chow were con- 





IRRELEVANT THIRST MOTIVATION 


tinuously available during the 24hr. period 
preceding the test. One and one-half hours 
before the test trials, additional Purina Chow 
and a rat which had been deprived of food for 
48 hr. were introduced into S’s living cage; 1 hr. 
before the test, S was placed in an individual 
feeding cage with water, Purina Chow, and a 
dish of the pellets used as incentive in the 
apparatus; and 30 min. before the test S was 
placed in an individual feeding cage with water 
available, but no food. This 30-min. period was 
introduced in order to make the group com- 
parable to the Thirsty-Drink Group with 
respect to the availability of food. 

Thirsty Group.—This group was treated in 
the same way as the Satiated Group, except that 
water was removed 22 hr. before the test, and 
no water was available during the special food 
satiation procedures. For 30 min. prior to the 
test, S was placed in an individual feeding cage 
with no food or water. Water deprivation of 
22 hr. was selected because it was Webb’s most 
effective irrelevant drive condition. 

Thirsty-Drink Group.—The treatment of this 
group was identical in every way to that of the 
Thirsty Group, except that a water bottle was 
available in the individual feeding cage for the 
30 min. prior to the test. It was noted that all 
Ss drank vigorously, but had ceased by the end 
of 30 min. 


RESULTS 


The results of the test day are 
presented in terms of three response 
measures—the number of responses 
during the 30 test trials, a speed 
measure, and a rate of responding 
measure. The speed measure was 
obtained by finding the median la- 
tency in seconds of the first 10 test 
trials for each S. This median was 
transformed to its reciprocal and 


TABL 


RESULTS OF 


349 


multiplied by 1000. For the rate 
measure, the total amount of time 
during which the panel was available 
to each S was calculated by adding 
the latencies, including the 30 sec. 
failures, of the 30 trials. The number 
of responses was then divided by the 
total time in minutes, and the result 
is expressed as the number of re- 
sponses per minute. This measure 
is novel in this type of situation but 
has the advantage of utilizing all of 
the data collected during the test 
trials. The medians, means and SD’s 
of the three measures for the four 
groups are presented in Table 1. 
Since several of the distributions were 
markedly skewed for all measures, the 
medians are regarded as the better 
index of central tendency. 

In order to evaluate the over-all 
effects of the drive conditions, sex, 
and possible interactions, Wilson’s 
nonparametric test (20) which permits 
such an analysis was applied to the 
number-of-response data. The re- 
sults of this test are summarized in 
Table 2. As may be seen, the effect 
produced by the drive variable was 
highly significant, while the Sex and 
Sex X Drive interaction effects were 
not significant. 

The chief interest in the study, of 
course, is not in over-all significance, 
but in specific comparisons between 
conditions. These comparisons were 
made by means of the Mann and 


E 1 


Test Day 
(N = 20 in each group) 





No. of Responses | 


Group omienqeneatngnaninaenens 
Median | Mean | SD 


Hungry 
Satiated 
Thirsty 
Thirsty-Drink 


26.50 | 24.90 

13.50 | 15.20 
13.00 | 13.70 
22.00 | 20.75 


4.18 
7.33 
7.38 
6.37 





Responses/ Min. 





| 
Median | Mean 


5.75 


| 
1. 
| 


Mean 


319 | 4.68 
150 | | 129 | 231 
101 | | 1.18 LO 
227 | 3.62 | 3.87 








350 


TABLE 2 


Nonparametric ANnALysis or NuMBER 
or Responses 


Source 


Drive 
Sex 
DxS 


Total 


*p < OO, 


Whitney U test. For the number- 
of-responses measure, the Hungry 
Group differed from the Satiated and 
Thirsty Groups at the .0O1 level and 
from the Thirsty-Drink Group at the 
03 level. The Thirsty-Drink Group 
differed significantly from the Satiated 
Group at the .02 level and from the 
Thirsty Group at the .O1 level. The 
Satiated and Thirsty Groups did not 
differ significantly. The results of 
this test when applied to the other 
response measures were essentially 
the same, except that the difference 


between the Hungry and Thirsty- 
Drink Groups was not significant for 
the other measures. 

An examination of the group meas- 
ures in Table 1 and the significance 
values will immediately indicate that 
the results of the test day were quite 


clear. As was to be expected, the 
Hungry Group showed the greatest 
response strength. There was no 
significant difference between the 
Satiated and Thirsty Groups, al- 
though for all measures the Satiated 
Group showed a small superiority. 
Thus, the present study fails to con- 
firm the findings of Webb and 
Brandauer who found the irrelevant 
thirst drive groups superior to the 
satiated control. The satiation of the 
thirst drive in the Thirsty-Drink 
Group not only did not reduce the 
level of response, but actually in- 
creased it as indicated by this group’s 


G. ROBERT GRICE AND JOHN D. DAVIS 


significant superiority 
Thirsty Group. 


over the 


Discussion 


In considering the failure of the present 
experiment to confirm Webb’s finding, 
it should be clearly understood that this 
experiment was not, strictly speaking, a 
repetition of the Webb study since the 
apparatus and experimental procedures 
were not the same. However, the 
rationale of the design was the same and 
there is no apparent reason why it should 
not provide an equally good test of the 
generalized drive hypothesis. The hy- 
pothesis should lead to the same pre- 
diction in both cases. This negative 
finding may indicate a serious restriction 
upon its generality, and we must at least 
state that a response learned with food 
reward will not always be activated by 
an irrelevant thirst drive. In view of 
Brandauer’s confirmation of Webb’s 
result, as well as the nature of inference 
concerning the null hypothesis, the 
weight of the evidence remains on the 
positive side, but further clarification of 
the problem must await future research. 

More can be said in interpretation of 
the finding that allowing the water- 
deprived Ss to drink prior to the test 
resulted in an increase in response 
strength. The fact that reducing the 
thirst drive produced more vigorous 
responding implies that the presence of 
the drive was actually depressing the 
response. This is, of course, the direct 
opposite of the drive summation theory 
and of the interpretation of Webb. 
There are various ways in which this 
depressing effect might come about. 
One possible explanation is in terms of 
the concept of incentive motivation (K) 
as developed by Spence (14). It may 
be that the anticipatory goal response 
for food (rgtooa), the mechanism which 
Spence assumes to provide the basis for 
K, is eliminated or seriously reduced by 
the presence of the thirst drive, but 
returns following satiation for water. 
This explanation is not entirely satis- 
factory, however, since there is strong 
indication in several maze-learning ex- 





IRRELEVANT THIRST MOTIVATION 


periments (7, 11, 15) that rotooa does 
occur in the presence of the thirst drive— 
particularly if it is not closely associated 
with drinking responses. 

Another possible explanation of the 
depression effect is suggested by the 
work of Kendler and his associates (8, 
9, 10). They have presented evidence 
that rats which have learned to go to 
food on one side of a T-maze show a 
tendency to avoid the food side when 
made thirsty. This is explained by 
hypothesizing that fractional anticipa- 
tory eating responses and the condition 
of water deprivation are incompatible, 
producing a conflict when they are 
simultaneously active. This state of 
conflict is assumed to be reduced by the 
avoidance of cues evoking the anticipa- 
tory eating response. This hypothesis 
seems to account for the present finding 
rather well. There does, however, ap- 
pear to be some difficulty in completely 
reconciling this theory with the fact that 
both cues associated with food (4) and 
the sight of food (19) will produce 
secondary reinforcement in thirsty rats. 

The question now to be considered is 


that of the drive state responsible for the 
increased level of response in the Thirsty- 


Drink Group. The prediction made on 
the basis of the Verplanck and Hayes 
analysis was clearly supported. It ap- 
pears that the hunger drive may have 
been present in considerable degree even 
though responding was depressed by the 
thirst drive in the Thirsty Group. As- 
suming that the Webb and the Brandauer 
results are reproducible at least under 
some conditions, it now seems likely that 
the positive relation between response 
strength and hours of water deprivation 
was not the result of activation by the 
irrelevant thirst drive. The present 
results strongly suggest the implication 
that the hunger drive was also increasing 
during water deprivation and that the 
motivation may have been entirely 
“relevant.” 

One possibility, not yet considered, is 
that water deprivation actually reduces 
the hunger drive, although presumably 
not the need state itself. However, 
considering the increasing function found 


351 


by both Webb and Brandauer together 
with the present evidence that the effect 
of thirst is depressing, the best guess at 
present seems to be that hunger does 
increase during water deprivation but 
that performance is inhibited by some 
mechanism such as the ones suggested 
here. 

At the present time, it is probably wise 
to be cautious in evaluating the gener- 
alized drive theory in the light of these 
data. While this study does not support 
the theory and further implies that the 
Webb and Brandauer studies do not 
necessarily support it either, there is 
enough additional evidence to suggest 
that it has considerable validity. The 
theory may be valid as a general prin- 
ciple, but subject to restrictions arising 
from interaction effects between some 
specific drive states. 


SUMMARY 


Four groups of rats were trained under 
hunger motivation to push a panel for food 
reward, One group was given extinction trials 
under hunger motivation, one while satiated, 
and one under thirst motivation. The fourth 
group was deprived of water like the thirst 
group, but was allowed to drink before the 
extinction trials. 

There was no significant difference in the 
extinction measures between the satiated and 
thirsty groups. Thus, the experiment failed to 
confirm previous studies showing an activating 
effect of irrelevant thirst motivation. 

The water-deprived group which was allowed 
to drink displayed more response strength during 
extinction than the thirsty group. Possible 
reasons for a depressing effect of the thirst drive 
are discussed. 

The results are interpreted as implying that 
the positive relation previously found between 
hours of water deprivation and the strength, 
during extinction, of a response learned with 
food reward may be due to increasing strength 
of the hunger drive. 


REFERENCES 

1. Amset, A. The combination of a primary 
appetitional need with primary and 
secondary emotionally derived needs. /. 
exp. Prychol., 1950, 40, 1-14. 

2. Amset, A., & Maurzman, I. The effect 
upon generalized drive strength of emo 
tionality as inferred from the level of 





G. ROBERT GRICE AND JOHN D. DAVIS 


consummatory response. J. exp. Psy- 
chol., 1950, 40, 563-569. 

. Branpaver, C. M. A confirmation of 
Webb’s data concerning the action of 
irrelevant drives. J. exp. Psychol., 1953, 
45, 150-152, 

. D’Amato, M. R. Transfer of secondary 
reinforcement across the hunger and 
thirst drives. J. exp. Psychol., 1955, 49, 
352-356, 

. Faaser, I. E., & Spence, K.W. Complex 
learning and conditioning as a function 
of anxiety. J. exp. Psychol., 1953, 45, 
120-125. 

. Huu, C. L. Principles of behavior. New 
York: D, Appleton-Century, 1943. 

. Kenpier, H. H. The influence of simul- 
taneous hunger and thirst drives upon the 
learning of two opposed spatial responses 
of the white rat. J. exp. Psychol., 1946, 
36, 212-220. 

. Kenpter, H. H., Karas, A. D., & 
Scurier, A. M. Studies of the effect of 
change of drive: III. Amounts of 
switching produced by shifting drive from 
thirst to hunger and from hunger to 
thirst. J. exp. Psychol., 1954, 47, 179- 
182. 

. Kenpover, H. H., & Levine, S. Studies of 
the effect of change of drive: I. From 
hunger to thirst in a T-maze. J. exp. 
Psychol., 1951, 41, 429-436. 

. Kenner, H. H., Levine, S., Arrcnex, E., 
& Perers, H. Studies of the effect of 
change of drive: II. From hunger to 
different intensities of a thirst drive in a 
T-maze. J. exp. Psychol., 1952, 44, 1-3. 
. McAuuster, W. R. The spatial relation 
of irrelevant and relevant goal objects as 


a factor in simple selective learning. J. 
comp. physiol. Psychol., 1952, 45, 531-537. 

- Mowracug, E. K. The role of anxiety in 
serial rote learning. J. exp. Psychol, 
1953, 45, 91-96. 

. Spence, K. W. Current interpretations of 
learning data and some recent develop- 
ments in  stimulus-response theory. 
Learning theory, personality theory, and 
clinical research. The Kentucky sym- 
posium. New York: Wiley, 1954. 

. Spence, K.W. Theoretical interpretations 
of learning. In C. P. Stone (Ed.), 
Comparative psychology. (3rd ed.) New 
York: Prentice-Hall, 1951. 

- Srrance, J. R. Latent learning under 
conditions of high motivation. J. comp. 
physiol. Psychol., 1950, 43, 194-197. 

. Tayuor, J. A. The relationship of anxiety 
to the conditioned eyelid response. /. 
exp. Psychol., 1951, 41, 81-92. 

. Verrianck, W. S., & Haves, J. R. Eating 
and drinking as a function of maintenance 
schedule. J. comp. physiol. Psychol., 
1953, 46, 327-333. 

. Wess, W. B. The motivational aspect of 
an irrelevant drive in the behavior of the 
white rat. J. exp. Psychol., 1949, 39, 
1-14. 


. Wixe, E. L., & Casey, A. The secondary 


reinforcing value of food for thirsty 
animals. J. comp. physiol. Psychol., 
1954, 47, 240-243. 


. Witson, K. V. A distribution-free test of 


analysis of variance hypotheses. Pry- 
chol, Bull., 1956, $3, 96-101. 


(Received May 10, 1956) 





Journal 


Experimental Psychology 
Vol. 53, 


0. 5, 1957 


PREDICTION FROM AND INTERACTION AMONG 
MULTIPLE CONCURRENT DISCRIMINATIVE 
RESPONSES ! 

CHARLES W. ERIKSEN *# 

The Johns Hopkins University 


In the present study Os were pre- 
sented with a size-judgment task 
under the method of absolute judg- 
ment. Instead of using only one 
response or judgment to each stimulus 
presentation as is usually done, the 
present experiment varied the number 
and the nature of the judgmental 
responses. Here Os were required to 
make not only a verbal judgment to 
each stimulus but were also required 
to move a lever to a distance propor- 
tionate to the size of the presented 
stimulus and to make a “second 


guess” verbal judgment. 
The primary aim of this study was 
to examine the role of supplemental 


responses in a situation that contained 
a fair amount of inherent uncertainty. 
Specifically, this experiment asked 
whether there is an independent 
relation between each of the several 
response systems and the stimulus or 
whether all of the responses will 
function essentially as a unit. For 
example, the first verbal judgment 
may serve to classify the perception 
for O, and the lever response and the 
second verbal judgment may then be 
mere reflections of the first judgment 
without any independent relation to 
the stimulus. Phrased statistically, 
the question becomes one of deter- 
mining whether there exist significant 
partial correlations between each of 
the response systems and the stimulus 


P This investigation was supported by Re- 
search Grant M-1037 from the National Institute 
of Mental Health of the National Institutes of 
Health, Public Health Service. 

* Now at the University of Illinois. 


when the remaining response systems 
are held constant. 

A second question that can be asked 
concerns the possibility of differential 
accuracy between the verbal judg- 
ment and the lever-movement re- 
sponse. Verbal responses in general, 
and in this experiment in particular, 
are of a discrete categorizing nature. 
On the other hand, the linear move- 
ment of a lever is, for practical pur- 
poses, continuous. In the present 
experiment an O who is confronted 
with Stimulus X may be uncertain as 
to whether this is Stimulus X or 
X +1. For his verbal judgment he 
must respond with either the response 
for X or the response for X + 1, but 
his lever movement is not so re- 
stricted. If the standard lever move- 
ment in response to X was in the 
range of 8-10 mm., one might rea- 
sonably expect the movement to be 
closer to 10 mm. when O was un- 
certain as to whether the stimulus 
was X or X + 1 and closer to 8 mm. 
when O’s uncertainty was between 
X and X — 1. The continuous char- 
acter of the lever-movement response 
may thus contribute to a_ higher 
stimulus-response correlation for the 
lever response than would be obtained 
for the verbal responses, as well as 
contributing to a partial correlation 
between the stimuli and the lever 
response by reflecting the direction 
of O’s uncertainty. 

It should be noted that the present 
experiment also has an important 
relation to the subception effect (6). 
It has been shown elsewhere (1) that 





354 


the subception experiment of Lazarus 
and McCleary (6) is basically a 
situation requiring multiple responses 
(a GSR and a verbal response) to the 
same stimulus, and the subception 
effect consists of a significant partial 
correlation between the stimulus and 
the GSR when the verbal responses 
are held constant. That such a 
partial correlation does exist has been 
further shown by Eriksen (2), and it 
was partly the purpose of the present 
study to increase the understanding 
of the circumstances under which 
such partial correlations will be 
obtained. 


MetTuop 


Design.—The experimental task consisted of 
the absolute judgment of a series of 11 different 
sized, square stimuli. Three experimental 
groups were employed. In Group A, Os were 
required to use verbal responses consisting of 
the numbers “1” through “11” where the number 
“1” was assigned as the response to the smallest 
size, number “11” to the largest size, and the 
intermediate numbers were assigned to the 
intermediate stimuli in order of increasing size. 


In addition to the verbal judgment, Os were 


asked to move a vertically mounted lever 
through a segment of arc that was proportionate 
to the size of the stimulus. After the verbal 
judgment and the lever response to each 
stimulus, each O was also asked to make a second 
verbal judgment of the stimulus, this time 
assuming that he had been told that his first 
verbal judgment had been wrong. 

The experimental treatment was the same 
for Os in Group B except for the manner in 
which they made the lever response. In Group 
A, Os had the lever in full view so that they were 
able to use visual cues while moving the lever. 
In Group B, Os had to reach their hands through 
a cloth curtain and position the lever without 
the use of visual control and feedback. This 
condition was employed as a partial control for 
Group A since it was anticipated that the lack 
of visual monitoring would result in greater 
error in the lever response which in turn should 
reduce the possibility of partial correlations 
between the stimuli and the lever responses. 

The Os in Group C differed from the other 
two groups in that they were permitted to use 
only two verbal responses, “Yes” and “No.” 
During the pretraining series that all groups 
were given, the middle-sized stimulus was pre- 


CHARLES W. ERIKSEN 


sented 20 times. During the experimental 
sessions, Os in Group C were asked to report 
“Yes” if the stimulus presented was the same 
size as the training stimulus and to report “‘No” 
if it was larger or smaller. The movement 
response for this group was the same as for Os 
in Group A, but no “second guess” verbal 
response was permitted them. 

Materials.—The O was seated across a table 
from E. A wooden partition separated and 
obscured E from O. Slightly to the left of O 
and approximately at eye level was an 8 X 8 in. 
aperture containing a milk-glass screen on 
which the size stimuli were presented. Pro- 
jecting from the table top directly in front of 
O’s right shoulder was an 8-in. knobbed lever 
which could be moved through 60 degrees of 
arc from O’s left to right. This gave a total 
length of 8.38 in. or 21.28 cm. Through a 
system of strings and pulleys, the movement 
of the lever was recorded on a kymograph on 
E’s side of the partition. The arrangement of 
the table and the positioning of the lever were 
such that O was unable to support his elbow 
while positioning the lever. For Group B, the 
lever was covered with a cardboard shield so 
that Os had to reach their hands through a cloth 
curtain in order to position the lever. 

The size stimuli were constructed from 
squares of dark gray paper photographed onto 
35-m. slides. They were projected onto the 
milk-glass screen. The projected size of the 
smallest square was 4 mm. square, and the 
stimuli increased in size by steps of 2 mm. per 
side so that the largest square was 34 mm. 
square. 

Procedure.—All Os were run individually. 
Prior to beginning the experimental judgments, 
Os were given a preliminary practice session 
which consisted of 20 presentations of the mid- 
sized stimulus during which Os practiced posi- 
tioning the lever to the midposition. Following 
this practice, each of the 11 size stimuli was 
presented four times in a random sequence with 
the restriction that each size would occur twice 
in the first 22 presentations. The lever posi- 
tioning response always began with the lever in 
the extreme left position, and O was required 
to move it toward his right a distance propor- 
tionate to the size of the stimulus. He was 
further instructed to make only a slight move- 
ment for the smallest stimulus and to make the 
movement for the largest stimulus very near to 
the maximum. 

Subjects.—A total of 31 Os were run, 10 in 
Group A, 12 in Group B, and 9 in Group C. 
All were male undergraduate students drawn 
from the elementary psychology course at The 
Johns Hopkins University. They were ran- 
domly assigned to the three experimental groups, 





MULTIPLE CONCURRENT DISCRIMINATIVE RESPONSES 355 


TABLE 1 


AveraGE INTERCORRELATION Between StimuLt 
AND Response Systems AND AMONG 
Response Systems ror Groups 


A ann B 


Groups 
Variables 


A 


TRV; .957 
ran | 88 
Ta. JIL 
TViVs | 939 
TV\L BS 


TVsL 933 


* With 19 d/, ¢ values of 2.09 and 2.86 are significant 
at the .05 and the .01 level, respectively. 


RESULTS 


The first step in the analysis of the 
data was to determine the average 
intercorrelation between the stimuli 
and the different response systems for 
each of the experimental groups. For 
each O the product-moment corre- 
lation between the stimuli (S) and 
the first verbal response (V,), the 
second verbal response (V2), and the 
lever response (L.) was computed, as 
well as the intercorrelations among 
these three response systems. These 
correlation coefficients obtained from 
the individual Os were then trans- 
formed to Fisher’s z and the average 
z value computed by experimental 
groups for each of the six stimulus- 
response and response-response cor- 
relations. The significance of differ- 
ences between Groups A and B were 
evaluated by computing the variance 
of the obtained z distributions and 
using a ¢ test for the differences be- 
tween the mean z values. The re- 
sulting values of t are shown in Table 
1 along with the average intercorre- 


lations for Groups A and B.* The 


* Group C is not represented in this table since 
the only product-moment correlation that could 
be computed for Os in this group was that 
between the stimuli and the lever response. 
The average value of this correlation was .949. 


average intercorrelations in this table 
were obtained by reconverting the 
mean 2z values into correlation 
coefficients. 

As can be seen from the table, the 
average intercorrelation between the 
stimuli and the first and second verbal 
responses (rsy, and rgy,) does not 
differ significantly between Groups 
A and B. However, the correlation 
between the stimuli and the lever 
responses (rgy) is significantly larger 
in Group A, as is also the correlation 
between the lever and the first verbal 
responses (ry,.). In Group B, on 
the other hand, the correlation be- 
tween the two verbal responses (ry,y,) 
tends to be higher although this 
difference is not quite significant at 
the .05 level. 

These group differences in average 
correlation are for the most part to 
be expected due to the greater error 
in the lever responses of Group B. 
Since Os in this group were required 
to position the lever without the 
advantage of visual monitoring of the 
response, the precision of their set- 
tings was less than for Group A. 
This greater error in the lever response 
would be expected to reduce the 
correlation of this response with the 
stimuli as well as with the other 
responses. This interpretation is 
further supported when we examine 
the average correlation between the 
stimuli and the lever response for 
Group C. Like Group A, the Os in 


Group C were allowed to visually 


monitor their lever responses, al- 
though their verbal responses were 
limited to “Yes” and “No.” The 
average correlation between the 
stimuli and the lever responses for 
this group was .949 which is signifi- 
cantly larger than the corresponding 
value of .917 obtained for Group B 


(t = 2.97 with 20 df) although not 





356 


significantly nor appreciably different 
from the value for Group A. 

When differences within experi- 
mental groups are examined, it is 
found that in both Groups A and B 
the first verbal response system shows 
the highest average correlation with 
the stimuli. In Group A the lever 
responses have a higher correlation 
with the stimuli than do the second 
verbal responses, but this difference 
is reversed in Group B. These dif- 
ferences in magnitude of average 
correlation between the stimuli and 
the three response systems are all 
significant beyond the .05 level for 
Group B when evaluated by the sign 
test, but in Group A only the difference 
between the intercorrelation of the 
first and second verbal responses with 
the stimuli is significant at this level.‘ 

While the above analysis shows that 
all three of the response systems are 
highly correlated with the stimuli, the 
major interest of this study is the use 
that Os make of the additional re- 


sponses or judgments that are per- 


mitted them. For example, if O 
makes an error on his first verbal 
judgment, do both of the other 
responses reflect this error or do they 
tend to correct for it? Alternatively 
stated, do each of the response sys- 
tems provide information about which 
of the stimuli has occurred over and 
above the information provided by 
either or both of the other response 


‘Since the z values representing the inter- 
correlations for the different Os within experi- 
mental groups are intercorrelated in turn in a 
highly complex way, the significance of the 
differences between the correlations of the 
different response systems with the stimuli 
could not be evaluated by the usual ¢ test for 
correlated means. Instead, the sign test was 
applied by determining the number of Os in 
each group for whom the correlation being 
tested was higher than the correlation with 
which it was being compared. Thus, for 
example, in Group B, rsv, was greater than 
rat for each of the 12 Os in the group (P < .01). 


CHARLES W. ERIKSEN 


systems? These questions can be 
answered by examining the correla- 
tions between the stimuli and a re- 
sponse system with either or both of 
the remaining response systems par- 
tialed out. 

The partial correlation between the 
stimuli and each of the three response 
systems was computed for each O in 
Groups A and B. The obtained 
partial correlations were then trans- 
formed to Fisher’s z and averaged by 
experimental groups. To determine 
whether a statistically reliable partial 
correlation existed within an experi- 
mental group, the variance of the 
obtained z distribution for this partial 
correlation was computed and a ¢ test 
applied to the average z value to 
determine whether it was significantly 
greater than zero. Between-group 
differences were also evaluated in this 
manner, but here the t test was applied 
to the difference between mean z 
values for the two groups. 

Table 2 summarizes the results of 
these analyses. The average partial 
correlations shown in the table were 
obtained by transforming the corre- 
sponding mean z values back to 


TABLE 2 


Averace PartiaL Corretations Between 
Stimutt anp Response Systems 
ror Groups A ann B 





TBVi-Va 

TBV,\-L 

TAVa-Vi 

T8Va-L 

TBL-Vi 

TAL-Vs 

T8V,-Val od 
T8V9-ViL - 
TBL-ViVa | 





* Significantly different from zero at the .05 level. 
** Significantly different from zero at the .01 level. 





MULTIPLE CONCURRENT DISCRIMINATIVE RESPONSES 


correlation coefficients, and the aster- 
isks are used to designate those 
average partial correlations that are 
significantly greater than zero. The 
significance of the between-group 
differences are indicated by the ac- 
companying values of t. 

For Group A all nine of the partial 
correlations have an average value 
significantly greater than zero, indi- 
cating that each of the three response 
systems has a relationship with the 
stimuli independent of either or both 
of the other two response systems. 
For Group B, however, only the 
first and the second verbal responses 
have an independent relationship 
with the stimuli. The failure of the 
lever responses to show an_ inde- 
pendent relationship with the stimuli 
in this group can be attributed to the 
greater error in this response system 
arising from the lack of visual moni- 
toring of these responses. This 


greater error in the lever responses 


would also seem to account for the 
differences in average partial corre- 
lation between the two groups. 


Discussion 


The above results can be efficiently 
summarized and discussed in terms of a 
schematic model of perception that has 
been proposed elsewhere by Eriksen (1) 
and Garner, Hake, and Eriksen (5). 
Briefly, the model provides for a dis- 
tinction between perception as a hypo- 
thetical concept and the responses by 
which it is inferred. In keeping with 
this distinction we can schematize the 
present experiment along the following 
lines: 

-* vi 


- 


Stimulus - - — —- Perceptual Process <en-L 


~ 
me. V: 


~ 


Here the three response systems avail- 
able to Groups A and B are conceived as 
essentially independent of each other, 
each responding to the perceptual process 


357 


and each having the opportunity to be 
influenced by unique sources of error. 

Such a schema provides the necessary 
and sufficient conditions for the present 
experimental results. Considering the 
different response systems as_ inde- 
pendent provides for the result that each 
response system has an _ independent 
relation to the stimulus or perceptual 
process. By further conceiving that 
each response system can be uniquely 
affected by certain variables or sources 
of error, provision is made for the sig- 
nificant partial correlations between the 
stimuli and the different responses. 

In addition to offering an efficient 
conceptualization of the present results, 
this model has the advantage of focusing 
attention upon the variables that will 
lead to noncorrelated errors in the 
different response systems. While 
neither the model nor the experimental 
data require that all the error in a re- 
sponse system be unique to that system, 
the significant partial correlations that 
were obtained do require that each of the 
three response systems have a source of 
error that is not common to either of the 
other two. Some factors leading to 
these noncorrelated errors have already 
been indicated. For example, it has 
been shown that the number of alter- 
native responses in a verbal response 
system will affect the kind and number 
of errors in this system without affecting 
a concurrent autonomic response (2), 
and there is reason to believe that 
unequal habit strengths among verbal 
responses might have a similar effect 
(2, 4). 

In the present study it was found that 
increasing the error in the lever response 
through elimination of visual monitoring 
did not affect the precision of the con- 
current verbal judgments. While the 
correlation between the stimuli and the 
lever response was significantly less in 
Group B, there was no significant dif- 
ference in the accuracy with which 
Groups A and B used the verbal re- 
sponses. Similarly, there was no indi- 
cation that the precision of the verbal 
responses had any effect upon the ac- 
curacy of the lever response. The Os 





358 


in Group C with only two verbal labels 
to classify the stimuli were as accurate 
in their lever responses as the Group A 
Os who were allowed to use 11 different 
verbal labels to classify the stimuli. 
However, the possibility cannot be ex- 
cluded that the Os in Group C may have 
made use of a more refined verbal 
classification at the covert level. 

In general, the problems of errors in a 
response system raises an interesting 
question concerning the number of 
different responses within a system that 
can be used effectively on a discrimi- 
nation task. The present study shows 
that prediction of the stimuli would be 
increased if O's second verbal response is 
used in the prediction in addition to the 
first. A possible explanation for this 
increased prediction is that the number 
‘ of alternative responses in the first 
verbal system was insufficient for O to 
express all the discrimination he was 
capable of making. It is reasonable to 
assume that in some instances when an 
O in Group A gave as his first verbal 
judgment the response “8,” he was 
uncertain as to whether the stimulus 


He had to 


was actually “7” or “8.” 
decide between these two responses. In 
other instances when the response “8” 
was given he had had to decide between 


“2"" and “9.” The second verbal re- 


sponse thus gave O an opportunity to 
show the direction of his uncertainty. 

A reasonable conclusion one might 
draw from this analysis is that the 
increased prediction accruing through 
the second verbal response would dis- 
appear if the number of alternative 
responses in the first verbal response 
system was increased. However, this 
conclusion does not fit with experimental 
observation. Eriksen and Hake (3) 
have shown on a very similar judgmental 
task that increasing the number of 
alternative responses in the system does 
not increase stimulus prediction over 
what is obtained when the number of 
different stimuli and the number of 
response alternatives are equal. 

In terms of the model presented above, 
the increased prediction obtained with 


CHARLES W. ERIKSEN 


a second verbal response must result 
from noncorrelated errors in the two 
sets of verbal responses. One factor 
that might constitute at least part of 
this noncorrelated error is a difference in 
judgmental task that confronts O at the 
time of the second verbal judgment. 
For the first judgment O must attempt to 
deal with a continuum of size and to 
place the stimulus in one of 11 categories 
along this continuum. However, at 
the time of the second judgment O is 
faced with essentially a two-category 
judgment. He has already located the 
stimulus along the size dimension, and 
his judgment now is in terms of whether 
the stimulus is apt to be larger or smaller 
than his first judgment. This difference 
in judgmental task may be one of the 
factors subsumed under the noncor- 
related error term. 


SUMMARY 


Observers were presented with a size-judg- 
ment task under the method of absolute judg- 
ment. Three experimental groups were used. 
Group A used the number series 1-11 for their 
verbal judgments and concurrently moved a 
lever a distance proportional to the size of the 
presented simulus. In addition, they were also 
required to give a second-guess verbal judgment 
to each stimulus. Group B was identical to 
Group A with the exception that the lever 
response was made without the benefit of visual 
monitoring. Group C was allowed only two 
verbal response categories and no second 
guesses, but the lever was moved under the 
same conditions as Group A. 

The results showed that the first and second 
verbal responses and the movement response 
each had a significant correlation with the 
stimuli when either or both of the other response 
systems were held constant. ‘These findings are 
interpreted in terms of noncorrelated errors in 
the different response systems, and some of the 
factors contributing to these noncorrelated 
errors are discussed. 


REFERENCES 


1. Extxsen, C. W. Subception: fact or arti- 
fact? Psychol. Reo., 1956, 63, 74-80. 

2. Erxtxsen, C. W. An_ experimental 
theoretical analysis of subception. 
J. Psychol., 1956 69, 625-634. 


and 
Amer. 





SUPPLEMENTARY REPORT 


3. Exixsen, C. W., & Haxe, H. W. Absolute 
judgments as a function of the stimulus 
range and number of stimulus and re- 
sponse categories. J. exp. Psychol, 
1955, 49, 323-332. 

4. Exixsen, C. W., & Wecusrer, H. Some 
effects of experimentally induced anxiety 
upon discrimination behavior. J. ab- 
norm. soc. Psychol., 1955, $1, 458-463. 


Journal of Experimental Psychology 
Vol. 53, No. 5, 1957 


6. Lazarus, R. S., & McCueary, R. A. 


359 


5. Garner, W. R., Hake, H. W., & Eriksen, 
C. W. Operationism and the concept of 
perception. Psychol. Rev., 1956, 63, 
149-159. 

Auto- 

nomic discrimination without awareness: 

a study of subception. Psychol. Rev., 

1951, 58, 113-122. 


(Received April 9, 1956) 


SUPPLEMENTARY REPORT: RESPONSE AVAILABILITY IN HUMAN 
TRIAL-~AND-ERROR LEARNING ! 


CLYDE E, NOBLE 
Louisiana State University 


In a previous experiment (1) the writer 
demonstrated the influence of response availa- 
bility (Nex) upon human performance in a 
nonverbal trial-and-error learning situation (2). 
When Ne was varied from 4 to 10 (holding task 
length constant), there was a progressive in- 
crease in the number of trials to mastery. The 
present study involves an extension of Ng from 
11 to 14. 

Method.—All details of the experiment, other 
than the variable condition, were the same as 
those used previously (1). In the four Ng 
treatments, the ratios of the number of relevant 
to available responses were 4/11, 4/12, 4/13, 
and 4/14. ‘The incorrect keys (R—) were: 


Ne Number of R— Key 

ll 1-3--6---10- 14----17--19 
12 -2—5-7-9--11--13--15----18-- 
13 1-3-5-7--10----13--15--17--19 
14 1-3-5-7-9_-11--13--15--17--19 
Each of the four Neg conditions (n = 48) was 
administered in a mixed factorial design under 
the 24 permutations of four reactions, with two 
Ss appearing in each cell. The Ss were 192 
basic airmen at Lackland Air Force Base 
assigned randomly to the four treatments 
within each permutation. 

Results.—The results are presented in Fig. 1 
along with the data of the earlier study. The 
mean number of trials required to attain each 
criterion in terms of first choices (1R +-) is plotted 
as a function of Ng. It appears that response 


1 The experimental work was part of the program 
of the Air Force Personnel and Training Research 
Center. Airman 1/C J. B. Downs collected the data 
in 1952; the analysis of the data was supported by the 
Council on Research of Louisiana State University. 


availability has only a slight retarding effect 
upon the rate of trial-and-error learning under 
the present conditions. 

Friedman’s nonparametric analysis of vari- 
ance test was applied to the distributions of 
ranks in mastery scores among the various 
conditions. For df = 3, x? = 7.29, which is 
only marginally significant (.05 < P < .10) of 
differences among the treatment effects. This 
result contrasts sharply with the highly signifi- 
cant differences (P < .001) obtained in the 
first experiment (1). 


en 


— 








RESPONSE AVAILABILITY (M,) 


Fic. 1. Difficulty as a function of response 
availability (Ng), with the parameter indicating 
the four levels of proficiency. The curves are 
broken at Ng = 10 to denote the experimental 
independence of the two sets of data. 


Inspection of response probability data re- 
vealed that each acquisition curve was sigmoidal 
and originated at a hypothetical value (1/Nx) 
representing the initial solubility of the problem. 
As before, there was no evidence of a serial 
position effect when errors were plotted over 
choice points. 





360 


A 6X%4X 24 analysis of variance was 
performed upon the IR+ measure during the 
first six, five-trial blocks. By restricting the 
possible scores to the middle 80% range, extreme 
skewness and heterogeneity of form and vari- 
ance were minimized. Practice is highly sig- 
nificant (F = 566.21; df = 5/480), but neither 
of the other two factors produces a main effect. 
The only interaction approaching significance is 
that between availability and permutations 
(F = 1.19; df = 69/96; P = .10). 

The initial effects of the Ng treatments were 
tested by a 4 X 4 analysis of variance of correct 
first choices during Trials 1-4. From the 
evidence above, permutation differences as well 
as practice interactions may be assumed to be 
zero. The effect of trials is again highly signifi- 
cant (F = 7.93; df = 3/564) with no inter- 
action, and the effect of availability is significant 
(F = 3.73; df = 3/188; P < 025). It appears 
that Ng produces real but small differences in 
performance early in training, but these effects 
disappear following the presolution period. 

As may be observed in connection with Fig. 
1, the possibility exists that some factor in the 
situation (e.g., “middleness”) favors Ss having 
14 available responses. To check this, the 
foregoing analysis was extended to. Trial 5; by 
comparison, the practice variance becomes more 
important (F = 28.51; df = 4/752) and availa- 
bility less so (F = 1.98; df = 3/188). Finally, 


the analysis was continued to include Trials 1-7; 
this resulted in the removal of all differences 
except those due to learning, which confirms the 
hypothesis that the correlation between difficulty 
and Nz is confined to the initial practice period 
in this experiment. 


CLYDE E. NOBLE 


Discussion——The absence of marked treat- 
ment effects supports the hypothesis of a de- 
creasing rate of gain in difficulty with increasing 
behavior freedom. One could object, however, 
that the comparison of differences in the first 
experiment (1) with those found in this one is 
invalid. It might be argued that a x* derived 
from a 7 X 24 matrix cannot be evaluated on 
the same basis as a x* coming from a 4 X 24 
matrix. In order to equalize the degrees of 
freedom, therefore, significance ratios were 
calculated separately for Ng varying from 4 to 
7and7to10. For df = 3, the x? values for the 
two sets of ranks are 30.46 and 8.36 respectively. 
The first is more significant (P < .001) than the 
second (.02 < P < .05), which in turn is more 
significant than the x* of 7.29 (.05 < P < .10) 
obtained from the present study. These in- 
equalities are quite in keeping with the hypothe- 
sis of a trend toward smaller differences with 
increases in Np. 

Considering both experiments, therefore, it is 
concluded that in human trial-and-error learning 
a negatively accelerated relationship exists 
between difficulty and Nz, and that the amount 
of over-all curvature varies with degree of 
proficiency. 


REFERENCES 


1. Nostz, C. E. Compound trial-and-error 
learning as a function of response availa- 
bility (Nr). J. exp. Psychol., 1955, 49, 
93-96. 

2. Noste, C. E., & Farnese, F.J. An apparatus 
for research in human selective learning. 
J. Psychol., 1955, 39, 475-484. 


(Received July 7, 1956) 








